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The  field  of  image  processing  is  very  vast  and  has  grown  considerably 
during  the  past  decade.  The  development  of  ever  faster  and  smaller  processors, 

I 

increasing  resolution  and  on-chip  processing  sensors,  larger  capacity  mass 
memories,  wider  bandwidth  communication  channels  and  more  sophisticated  digital 
processing  stations  have  provided  equipment  devices,  and  techniques  that  five 
years  ago  were  beyond  expectation.  Image  processing  has  found  a significant  role 
in  scientific,  industrial,  biomedical,  space  and  government  applications.  Being 
a vast  field,  image  processing  can  be  divided  into  different  subfields  relating 
to  different  aspects.  Image  transforms,  Image  coding,  Image  enhancement,  Image 
restoration,  feature  extraction  and  image  understanding  are  all  different 

aspects  of  image  processing. 

« » 

This  report  is  devoted  to  one  of  the  aspects  of  image  processing,  namely 
the  investigation  of  linear  transformations  for  automatic  cartographic  analysis 
and  feature  extraction.  Aerial  photographs  are  pictures  taken  from  some  dis- 
tance above  the  earth.  They  usually  contain  images  of  a variety  of  objects. 

These  objects  have  different  characteristics  and  so  can  be  classified  according 
to  some  specific  properties.  For  example  roads  and  rivers  can  be  put  into  one 
group,  since  on  a picture  they  usually  appear  as  a line.  After  this  categoriza- 
tion has  been  done,  an  attempt  can  be  made  to  enhance  some  of  these  features 
automatically  by  the  aid  of  some  mathematical  transforms.  The  actual  recogni- 
tion of  specific  cartographically  important  objects  requires  a non-linear  detec- 
tion process.  The  object  recognition  process  is  not  addressed  in  this  picture. 

A picture  is  a flat  object  whose  appearance  varies  from  one  point  to 
another  point.  In  a " black  and  white  " picture  this  variation  can  be  described 
by  a single  parameter  corresponding  to  the  total  amount  of  light  reaching  the 


observer  from  the  given  point.  In  accordance  with  the  above  remarks,  we  can  de- 
fine a picture  as  a two-dimensional  light  intensity  function  f(x,y),  where  x 
and  y denote  spatial  coordinates  and  the  value  of  f at  any  point  (x,y)  is  pro- 
portional to  the  brightness  (gray  level)  of  the  picture  at  that  point.  The  pic- 
ture, therefore,  is  a non-coherent  image  of  the  object. 

A digital  picture  is  a picture  f(x,y)  which  has  been  discretized  both  in 
spatial  coordinates  and  in  brightness.  In  this  content  a digital  picture  can  be 
shown  by  a matrix  whose  row  and  column  indices  identify  a point  in  the  picture 
and  the  corresponding  matrix  element  value  identifies  the  gray  level  at  that 
point.  The  element  of  such  a digital  array  are  called  picture  elements  or 
pixels. 


A picture  transform  takes  a picture  from  the  spatial  domain  into  another 
functional  domain.  The  recognition  of  some  cartographic  features  may  be 
facilitated  by  using  such  picture  transforms.  For  this  purpose  instead  of  sear- 
ching for  an  object  in  the  original  image,  its  transform  is  taken  and  it  is 
hoped  that  certain  salient  features  or  patterns  will  be  exhanced.  The  transform 
selected  should  have  a strong  response  to  the  characteristic  features  of  in  - 
terest  while  being  invariant  to  miscellaneous  parameters  (such  as  size,  trans- 
lation , etc. ) and  insensitive  to  the  general  background  which  is  of  little  or  no 
interest.  Since  different  mathematical  transforms  have  different  properties,  it 
will  be  interesting  to  investigate  their  properties  and  see  how  they  act  on 
different  functions.  For  example,  the  much  used  Fourier  transform,  in  which 
sines  and  cosines  are  the  kernel  functions,  brings  out  periodic  structures  of 
the  image  quite  well.  However,  the  Fourier  transform  is  not  optimal  or  even 
suitable  for  all  image  features.  Circular  objects  are  best  identified  with  the 
Bessel  transform  for  example,  while  other  features  may  be  more  efficiently 


•j 

identified  with  the  Walsh  or  Haar  transforms.  The  task  is  to  determine  which 

• j 

transform,  if  any,  is  best  suited  for  identifying  a given  terrain  features.  The 
paramount  consideration  in  selecting  a transform  shall  be  its  success  in  identi- 
fying a given  object  but  other  factors  will  be  considered,  such  as  execution 
speed  with  digital  and  analog  processors. 

To  follow  the  pattern  of  the  above  discussion,  tills  paper  is  comprised  of 
six  chapters . 

Chapter  one,  the  present  chapter,  contains  some  introductory  discussions. 

At  the  end  of  this  chapter,  the  objects  usually  present  in  an  aerial  imagery 
are  classified. 


Chapter  two  gives  a review  of  continuous  transformations.  Continuous  trans- 
formations apply  to  continuously  dense  picture  elements  and  supply  a rich  re- 
pertoire of  analytic  techniques  and  results.  Fourier  transform,  Hankel  trans- 
form, Matched  filtering  process  and  some  novel  extrapolation  techniques  are 
discussed  in  section  A of  this  chapter.  In  section  B of  chapter  two  some  candi- 
date features  from  the  objects  usually  present  in  photographs  are  chosen  and 
used  to  illustrate  the  application  of  the  theory.  The  transformations  discussed 
in  section  A are  applied  to  those  candidate  features  to  enhance  the  characteris- 
tics defined  by  their  analytical  properties.  Some  examples  of  two-dimensional 
matched  filters  are  also  given  in  section  B of  chapter  two. 

One  continuous  transform,  the  Karhunen-  Lo^ve  transform,  was  studied  in 
interim  report  # 4.  This  transform  deals  with  the  statistics  of  an  image  rather 
than  its  exact  shape  or  geometry.  This  transformation  has  been  mainly  proposed 


w 


for  other  uses  in  image  processing  rather  than  feature  extraction.  Since  the  re- 
sults of  our  study  on  the  Karhunen  - Ldeve  transform  didn't  look  promising,  it 
is  not  included  in  this  report. 

Chapter  three  follows  the  same  pattern  of  chapter  two  but  for  discrete 
transformations.  A review  of  the  analytical  properties  of  Walsh-Hadamard  trans- 
form (WHT) , Haar  transform  (HT) , Slant  transform  (ST) , and  Discrete-cosine 
transform  (DCT)  is  given  in  section  A of  this  chapter.  In  section  B some  exam- 
ples with  potential  cartographic  applications  are  examined  . 

In  looking  for  an  object  feature  image  in  a given  picture,  there  is  not 
usually  any  a priori  information  about  the  orientation  or  the  size  of  the  object. 
For  this  reason,  it  is  very  desirable  to  see  how  the  result  of  a transform  is 
affected  by  translation,  rotation  or  scaling  of  the  original  object  Image.  In 
chapters  two  and  three  we  have  tried  to  consider  and  show  those  effects  for 
different  transforms.  This  is  done  both  analytically  and  by  example. 

In  chapter  four,  a software  package  is  developed  for  the  evaluation  of  two- 
dimensional  (WHT) , (HT),  (ST) , and  (DCT)  in  processing  the  data  pixels.  The 
entire  program  is  written  in  Fortran  IV  language.  This  package  is  used  for 
various  examples  of  cartographic  features  and  the  results  are  given  in  Appen- 
dix III. 

In  chapter  five,  a study  of  hardware  feasibility  of  a signal  processing 
system  is  made.  In  this  chapter,  a special  purpose  signal  processor  for  carto- 


graphic studies  is  proposed.  The  processor  makes  use  of  the  two-dimensional 
Hadamard/Haar/Slant/Discrete  Cosine  transforms.  An  assessment  is  made  of  the 
speed  and  of  the  cost. 


In  chapter  six,  some  conclusions  and  recommendations  have  been  made  based 
upon  our  transforms  study  in  the  first  five  chapters.  Those  conclusions  and  re- 
commendations are  merely  based  on  the  analytical  properties  of  the  transforms 
studied  or  on  the  examples  used  to  illustrate  those  properties.  The  conclusions 
Eire  at  best,  estimates  and  should  not  be  taken  as  the  last  word.  It  is  very 
possible  that  a deeper  look  into  the  subject  may  contradict  some  of  our  conclu- 
sions. The  above  remarks  should  be  taken  as  a caveat  to  the  reader. 

Thus  our  interpretations  and  value  judgements,  while  given  in  order  to 
share  the  insight  we  gained  while  carrying  out  these  studies,  must  be  tempered 
with  the  recognition  that  they  are  based  on  a limited  analysis  and  a still  in- 
complete understanding  of  the  picture  interpretive  process. 

GLASSIFICATION  OF  OBJECTS  USUALLY  PRESENT  IN  AERIAL  IMAGERY 

In  order  to  maximize  the  likelihood  of  success  in  using  linear  transforms, 
it  will  be  necessary  to  classify  the  principal  attributes  of  particular  carto- 
graphically  interesting  objects.  This  problem  can  be  possibly  approached  from 
different  ways.  A good  choice  is  to  categorize  the  cartographic  objects  acc- 
ording to  the  dimensionality  of  their  images.  While  the  image  of  a road  appears 
as  a line,  the  image  of  a lake  covers  some  area  of  the  picture.  So  according  to 
the  feature  dimension,  the  features  that  are  usually  present  in  an  aerial  carto- 
graph  can  be  divided  into  three  different  groups,  point  features,  line  features 
and  area  features. 

The  distinction  between  points,  lines  and  areas  depends  on  i (l)  the  dis- 
tance from  which  the  picture  has  been  taken.  (2)  the  degree  of  spatial  discre- 
tization of  the  picture.  For  example,  the  image  of  a storage  container  for  oil 


will  appear  as  a point  if  the  picture  is  taken  from  a far  distance.  However,  a 
picture  of  the  same  scene  taken  from  a closer  distance  can  have  the  image  of  the 
container  shown  as  a disk.  On  the  other  hand,  suppose  we  discretize  an  8"  X 8" 
picture  into  an  8 X 8 square  array  (see  chapter  three),  then  a circle  of  radius 
0.5  " can  appear  only  as  a point  or  not  appear  at  all.  However,  if  the  picture 
is  discretized  into  an  80  X 80  square  array,  the  same  circle  will  cover  some 
area  of  the  digital  picture.  So  we  should  have  in  mind  that  when  we  compare  the 
features  according  to  their  dimensionalities,  the  comparison  will  be  true  only 
for  features  present  in  the  same  picture  or  for  pictures  taken  from  almost  the 
same  distance  and  with  the  same  degree  of  discretization. 

POIOT  FEATURES 


Images  of  objects  of  small  size  (small  length  and  small  width)  appear  as 
points  on  an  aerial  photograph.  Examples  of  these  objects  are  isolated  build- 
ings, storage  thanks,  tunnel  entrances,  etc.  Point  features  on  sui  image  can  be 
different  in  brightness  suggesting  the  presence  of  different  objects.  A storage 
container  made  of  metal  will  surely  appear  brighter  than  one  made  of  concrete. 
Point  features  may  appear  as  a set  on  a picture  and  the  set  can  have  some  spe- 
cial characteristic.  For  example,  if  a grounded  aircraft  is  taken  as  a point 
feature,  then  one  squadran  of  grounded  aircrafts  will  exhibit  point  periodici- 
ties on  the  image.  Another  example  can  be  the  storage  containers  for  oil  and 
gas  which  usually  have  a clustering  symmetry  among  themselves. 


LINE  FEATURES 


Any  object  whose  length  is  much  longer  than  its  width  will  appear  as  a line 


on  an  aerial  cartograph.  Rivers,  roads,  transmission  lines,  bridges,  etc.  axe 
all  examples  of  the  so-called  line  features.  While  some  line  features  like  a dam 
appear  as  a segment,  others  like  roads  or  rivers  may  extend  through  the  whole 
picture.  The  line  features  are  usually  different  in  characteristics,  for  exam- 
ple, while  runways  are  invariably  straight  and  frequently  have  intersections, 
roads  and  rivers  usually  are  expected  *o  have  unbounded  extent  and  will  likely 
exhibit  parallelism  and  meandering  symmetries,  respectively. 

There  is  a quite  nice  distinction  between  the  line  features  that  are  man- 
made and  those  which  are  natural.  The  former  one  usually  shows  some  kind  of  re- 
gularity or  straightness  while  the  latter  one  is  not  so  neat.  For  example,  if  a 
pipeline  is  to  be  extended  between  two  points  on  a rather  flat  surface,  it  will 
certainly  be  built  as  a straight  line,  but  a river  between  those  two  points  can 
follow  some  other  arbitrary  direction.  Another  property  which  can  be  only  re- 
lated to  those  man-made  line  features  is  the  possibility  of  having  a set  of  them 
with  some  special  properties  among  the  members.  For  example,  parallelism  is  a 
characteristic  of  some  man-made  line  features  like  dual  highways  or  transmission 
lines . 

AREA  FEATURES 


The  objects  whose  images  cover  a relatively  large  portion  of  a picture  are 
considered  as  area  features.  These  features  can  have  any  arbitrary  shape  depend- 
ing on  the  nature  of  their  construction.  A lake,  a cemetery,  an  urban  area,  etc., 
are  all  examples  of  area  features.  Shape,  size,  and  convexity  are  the  features 
that  make  a distinction  between  different  area  featwes.  Neat  geometrical  shape 
or  symmetry  is  usually  the  characteristic  of  man-made  areas.  For  example,  a 
field  can  show  rectangular  symmetry  while  a large  container  may  show  circular 


symmetry.  Most  of  man-made  axea  features  have  some  internal  properties  if  exa- 
mined in  close  detail.  For  example,  towns  will  have  internal  straight  line 
parallelism  due  to  streets  and  point  periodicities  due  to  housing  developments. 

A list  of  the  objects  usually  present  on  cartographs  is  given  on  the  follow- 
ing page. 


CARTOGRAPHIC  FEATURES 


POINT  FEATURES 

Isolated  buildings 
Storage  tanks 
Quarry  or  borrow  pit 
Tunnel  entrance 

LINE  FEATURES 


Rivers  and  streams  with  water 
Drainage  channels  without  water 
Canals 

Dual  highways 

Primary  roads 

Secondary  roads 

Unpaved  roads  and  trails 

Transmission  lines 

Pipe  lines 

Levees 

Dam 

Rapids  and  falls 
Bridges 

Shoreline  (large  water  body) 
Airport 
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In  this  chapter,  some  two-dimensional  continuous  linear  transformations 
will  be  studied.  Continuous  transformations  are  those  transformations  that  apply 
to  continuously  dense  picture  elements.  Since  a picture  is  a function  of  two 
real  variables,  so  all  the  transforms  used  in  picture  processing  will  be  two 
dimensional.  In  general,  a two-dimensional  continuous  transform  is  given  by, 

-foo 

F(£,i?)  « jfjfK(x,y,^,T7)  f(x,y)  dx  dy 
— oo 

in  which  f(x,y)  is  the  input  function  in  spatial  domain,  f(£,  V ) is  the  func- 
tion in  the  transformed  domain  and  K(x,y,£,  V)  is  the  kernel  of  the  transform. 

The  integral  is  taken  over  the  whole  domain  in  which  the  function  exists.  Fourier 
and  Hankel  transforms,  discussed  in  this  chapter,  are  examples  of  two-dimensional 
continuous  transforms.  In  section  2A,  the  analytical  properties  of  these  trans- 
forms have  been  investigated  and  in  section  2B  some  examples  have  been  used  to 
illustrate  their  properties. 

In  the  second  part  of  this  chapter,  the  two-dimensioanl  matched  filtering 
process  is  discussed.  Matched  filters  are  used  to  detect  object  images  in  a pic- 
ture. Although  matched  filtering  can  be  considered  as  a linear  transform,  there 
is  a significant  difference  between  matched  filtering  and  transforms.  In  a linear 
transform,  the  kernel  of  the  transform  is  fixed  and  unique  for  any  input  func- 
tion. In  matched  filtering,  the  kernel  is  matched  to  the  object  to  be  detected 
(which  needs  to  be  known),  and  so  varies  from  one  case  to  another.  Analytical 
properties  of  the  two-dimensional  matched  filter  have  been  discussed  in  section 
2A  along  with  some  examples  in  section  2B.  Some  novel  ideas  for  allowing  the 
matched  filter  to  be  used  in  cases  when  the  object  is  not  exactly  known  are  also 


(13) 


discussed. 


In  the  last  part  of  this  chapter,  some  new  extrapolation  techniques  are 
discussed.  It  has  been  shown  that  in  the  special  case  of  bandlimited  signals, 
one  can  extrapolate  a two-dimensional  function  in  terms  of  a known  part.  It  is 
suggested  that  this  technique  may  be  used  to  exhance  a hidden  part  of  a picture 
vising  the  information  conveyed  by  the  rest  of  that  picture. 


The  main  purpose  of  studying  continuous  transforms  in  this  chapter  is  to 
see  if  they  can  be  used  in  cartography.  Some  transforms  act  better  on  a parti- 
cular feature  than  others.  This  behaviour  of  a transform  is  usually  manifested 
by  the  characteristics  of  its  kernel.  In  this  chapter,  we  will  try  to  show  which 
transform  is  most  suitable  for  extracting  a particular  feature. 

In  extracting  a feature  from  aerial  imagery,  we  usually  look  for  the  image 
of  an  object  when  there  is  no  a priori  information  about  its  orientation  or 
size.  For  this  reason,  it  will  be  instructive  to  see  how  a particular  transform 
behaves,  as  an  object  image  is  translated,  rotated  or  scaled.  In  our  study  of 
different  transforms  in  this  chapter,  a try  has  been  made  to  answer  the  above 
question. 

2A  » THEORY 

FOURIER  TRANSFORM 

In  Fourier  transform,  sines  and  cosines  are  the  kernel  functions.  The  most 
suitable  transform  for  extracting  a particular  feature,  is  a transform  whose 
kernels  are  as  closely  matched  to  that  feature  as  possible.  Since  sines  and 
cosines  are  periodic  functions .even  before  going  into  details  of  analytical  pro- 
perties of  Fourier  transform,  we  can  predict  that  this  transform  will  act  quite 
well  on  periodic  structures  in  a photograph.  Parallel  streets  in  an  aerial  pic- 
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ture  of  an  urban  area,  is  an  example  of  such  periodicity. 


4 


Let  f(x,y)be  a function  of  two  independent  variables  x and  y,  then  its 
Fourier  transform  F(u,v)  is  defined  by, 


4-co 


F(u.v)  = J /f(x,y)  exp[-j2ll(ux+  vy)]  dx  dy 


(2A-1) 


— oo 


The  inverse  Fourier  transform  of  F(u,v)  is  given  by, 


+ oo 


f(x,y)  = //F(u,v)  exp  [ II  (ux  + vy)  ] du  dv 


(2A-2) 


— oo 


The  pair  of  functions  f(x,y)  and  F(u,v)  related  by  equations  (2A-1)  and  (2a -2) 
will  be  denoted  by 


f(x,y)^=^F(u,v) 


(2A-3) 


The  Fourier  transform  of  f(x,y)  may  not  exist  unless  f satisfies  certain  condi- 
tions [l]  . The  main  condition  for  f(x,y)  is  to  be  continuous  and  square  inte- 
grable. 

The  Fourier  transform  of  a real  function  is  generally  complex. 


F(u,v)  = R(u,v)  + j l(u, v) 


= | F(u,v) j exp[ j<t(u,v)] 


(2A-4) 

(2A-5) 


where, 


F(u,v)  = [ R2(u,v)  + I2(u,v)] 


1/2 


(2A-6) 
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Suppose  we  have, 


f(x,y)<=>F(u,v) 

then 

f(x  -xQI  y - y0)<=>F(u,v)exp[-j2n  (uxQ  + vyQ)  ] (2A-9 

0 

It  is  interesting  to  note  from  equation  (2A-9)  that  a shift  in  f(x,y)  does  not 
affect  the  magnitude  of  its  Fourier  transform, 

|f(u,v)  exp[-J2lJ  (uxQ  + vyQ)]|  = |f(u,v)|  (2A-H 

This  is  a very  good  result  since  visual  examination  of  the  transform  is  usually 
limited  to  a display  of  its  magnitude  and  so  Fourier  spectrum  possess  transla- 
tional invariance. 

If  we  introduce  polar  coordinates, 

(16) 
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x = r QosQ 
y = r Sin0 


u = w Cos  <t> 
v = w Sin0 


(2A-11) 


and  if  we  have 


f(x,y)  F(u,v) 


then 


f(r,0)<=>F(w,*) 


(2A-12) 


Now  if  the  coordinates  of  f(r, G ) are  rotated  by  the  amount  G0  it  can  be  shown 
that, 


f(r,0+  e0)*=*F(w  ,<t>+Gn) 


(2A-13) 


The  above  result  is  very  interesting  and  important  for  cartography  since  it 
shows  that  if  f(x,y)  is  rotated  by  an  angle  0o  , then  F(u,v)  is  rotated  by  the 
same  angle. 


Some  other  useful  properties  of  the  Fourier  transform  pair  are, 


a f(x,y)^  a F(u,v) 


(linearity)  fj(x,y)  + f^x.y)^  Fjfu.v)  + F2(ufv) 


(2A-14) 

(2A-15) 


and, 


(scaling)  f(ax,by)43^>  ~ab  - F(u/a  , v/b) 


(2A-16) 


(1?) 
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From  equation  (2A-16) , one  can  find  the  Fourier  transform  of  a scaled  version  of 
the  function  f(x,y),  [see  appendix  I]  as, 


1 

f(fcx,  ky)4==^— p-  F 

k 


(2A-17) 


The  above  relation  shows  that  the  Fourier  transform  of  a scaled  version  of  a fun- 
tion  is  a scaled  version  of  the  Fourier  transform  of  the  function  multiplied  by 
the  squared  inverse  of  the  scaling  factor.  As  a result  the  shape  is  preserved. 


For  a more  detailed  study  of  Fourier  transform,  see  (l),  (2),  and  (3). 


HANKEL  TRANSFORM 


One  of  the  features  usually  present  on  an  aerial  photograph  is  the  storage 
container.  A single  storage  container  appears  as  a centered  circle  or  as  a disk. 
Storage  containers  can  be  located  as  a cluster.  Sometimes  this  cluster  is  cir- 
cularly symmetric.  The  Hankel  transform  discussed  in  this  section  appears  very 
useful  in  bringing  out  the  object  images  with  circular  symmetry.  This  property 
is  based  on  the  presence  of  the  Bessel  function  as  the  kernel  of  Hankel  trans- 
form. 

Given  a function  f(r)  and  a real  constant  w,  we  form  the  integral, 
oo 

F(w)  = y*r  f (r)  Jq(wt)  dr  (2A-18) 

0 

This  integral  defines  the  Hankel  transform  F(w)  of  f(r).  In  equation  (2A-18), 
Jq(x)  is  the  Bessel  function  of  order  zero  as  ahown  in  Fig.  (2A-1) 


If  a two-dimensional  signal  has  circular  symmetry  as,  e.g.  a circle,  then 


Its  Fourier  transform  reduces  to  a Hankel  transform.  So  the  theory  of  Hankel 
transforms  can  be  studied  as  a special  case  of  the  corresponding  theory  of 
Fourier  transforms. 

Suppose, 

f(x,y)  = f(^x2  + y2)  (2A-19) 

then  with, 

V2  2 

x + y 

(2A-20) 

I 2 2 

u = h Cos  ^ v = w Sin  <J>  w = ^|  u + v^ 
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we  have, 

ux  + vy  = wr  Cos  ( d - <p)  dx  dy  ■ r dr  dJ 

Inserting  the  transformations  into  the  integral  form  of  Fourier  transform  which 
defines  F(u,v),  we  will  have, 


F(u,v)  = J rf(r)  J e~ jwr  Cos(®"^d*  dr  = 2ujrf{r)  JQ(wr)  dr  (2A-21) 

O - n 0 

= 2llF(w) 

It  can  be  shown  that  the  inversion  formula  for  Hankel  transform  is, 


f(r)  = J*  F(w)  JQ(rw)  dw  (2A-22) 


By  looking  at  the  Hankel  transform  as 

(20) 

a special  case  of  a two-dimensional 
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Fourier  transform,  all  of  its  properties  can  be  derived  from  the  corresponding 
properties  of  Fourier  transform.  The  most  important  observable  fact  is  that  F(w) 
has  circular  symmetry.  This  means  that  the  Fourier/Hankel  transform  of  a cir- 
cularly symmetric  function  will  be  circularly  symmetric.  Reference  (2)  gives  a 
very  detailed  study  of  Hankel  transforms. 

MATCHED  FILTERING 

One  of  the  major  problems  in  communications  engineering  is  to  detect  a sig- 
nal which  has  been  degraded  by  noise.  The  matched-filter  implemented  by  electri- 
cal circuits,  is  widely  used  as  a means  of  one-dimensional  signal  detection.  This 
filter  is  optimum  in  minimizing  the  signal -to -noise  ratio  (SNR)  and  has  been 
studied  in  detail  (4-5) . 

The  two-dimensional  version  of  the  matched  filter  which  is  a straight  for- 
ward extension  of  the  one-dimensional  case  can  be  used  for  detecting  objects 
within  images. 

Let  f(x,y)  be  the  desired  object  to  be  detected.  Since  practically  any 
image  is  accompanied  by  some  kind  of  noise,  on  observation  the  image  contain- 
ing the  object  f(x,y)  can  be  shown  as  a sum  of  two  parts.  The  first  part  is  the 
object  itself,  and  the  second  part  is  some  additive  noise.  If  we  show  the  observ- 
ed image  by  g(x,y)  and  the  noise  by  n(x,y)  we  have, 

g(x,y)  = f (x,y)  + n(x,y)  (2A-23) 

The  noise  n(x,y)  can  be  of  any  kind,  it  may  be  the  background  of  a picture 
containing  the  desired  object,  or  it  can  be  the  noise  of  the  communication 

\ 
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channel  over  which  the  image  has  been  transmitted  or  both.  The  assumption  about 
the  additive  nature  of  the  noise  is  usually  met  in  practice  with  good  approxima- 
tions since  no  detailed  statistical  assumption  such  as  Gaussianess  is  necessary. 


To  detect  f(x,y)  from  g(xfy),  we  pass  g(x,y)  through  a linear  filter  whose 
impulse  response  (point  spread  function  in  optics)  is  h(x,y),  and  we  try  to  find 
h(x,y)  in  such  a way  that  the  signal-to-noise  ratio  would  be  maximum  at  the  out- 
put of  this  filter. 

If  the  signal  g(x,y)  is  given  to  the  input  of  a filter  whose  inpulse  res- 
ponse is  h(x,y),  the  output  will  be, 

g(x,y)  * h(x,y)  = f(x,y)  * h(x,y)  + n(x,y)  * h(x,y)  (2A-24) 

in  which  * denotes  the  convolution  operation.  This  is  shown  graphically  in 
Fig.  (2A-2)  . 
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The  instantaneous  signal  power  at  the  filter  output  is  given  by, 

S = |f(x,y)  * h(x,y)|2  (2A-25) 

+ oo  2 

« I /jfr(u.v)  H(u,v)  exp[j(u|  + W )]  du  dv  j (2A-26) 


-oo 


F(u,v)  and  H(u,v)  are  respectively  the  Fourier  transforms  of  f(x,y)  and  h(x,y). 
The  noise  power  at  the  filter  output  is,  (6) 


N = j H(u'v)|2  du  dv 

(22) 


(2A-2?) 


— oo 


in  which  Sn(u,v)  is  the  power  spectrum  of  n(x,y)  . The  signal-to-nolse  ratio  at 
the  filter  output  is, 

+ °° 


_S_ 

N 


H(u, v)  exp  [j(u£  + vt))]  du  dv 


— oo 


jJ$n  (u,v)|H(u,v)|  “ du  dv 

— oo 


using  the  Schwartz  inequality  (5),  we  find  the  s/N  is  maximum  for 


H(u,v)  = 


F (u,v)  exp  [-j(u£  + vi7)] 



If  the  noise  is  white, 


Nr 


and 


S„(u.v)  = / 


H(U ,V ) = F*(u,v)  exp  [-j(u£  + vi))] 

W0 


or 


h(x,y)  = f(f-x,  T)  -y) 

N0 


(2A-28) 


(2A-29) 


(2A-30) 


(2A-31) 


(2A-32) 


As  we  see  in  the  case  of  additive  white  noise,  the  optimum  filter  impulse  res- 

Q 

ponse  is  an  amplitude  scaled  version  of  the  signal  image  field  rotated  by  180 
and  so  the  term  matched  filter  is  used. 


For  the  case  of  colored  noise,  the  impulse  response  of  the  matched  filter  was 

given  by  Bq.  (2A-29).  This  is  equivalent  to  the  white  noise  matched  filter  pro 

< 

ceded  by  a whitening  filter  of  system  response  proportional  to  the  inverse  of 
the  noise  spectrum.  For  more  on  this  subject  see  (7). 


The  matched  filter  output  is  given  by, 


S0(x,y)  = g(x,y)  * h(x,y) 

+ ao 

= jjela.p)  h(x  -a,  y - g)  da  d(3  (2A-33) 

-CD 

+ O0 

= J/g(o  .0)  f(^+a  - X,v  +($-  y)  da  dp  (2A-34) 

0 _ cc 

Looking  at  Eq.(2A-26),  we  see  that  the  matched  filter  output  was  maximized  at 
the  point  ( f ,*>  ).  At  this  point  the  output  is, 

+ 03 

XT*  a-H  ) ) da  dp  (2a -35) 

0 -CO 

As  we  see  at  this  point,  the  matched  filter  output  is  simply  the  correla- 
tion between  the  image  and  the  desired  object  to  be  detected.  The  point  ( f , V ) 
is  called  the  matched  filter  offset  . Ordinarily,  the  parameters  (f  , V)  of  the 
matched  filter  transfer  function  axe  set  to  zero  so  that  the  origin  of  the  out- 
put plane  becomes  the  point  of  no  translational  offset  between  g(x,y)  and  f(x,y). 

In  using  matched  filters  for  feature  extraction,  and  specially  if  the  out- 
put is  going  to  be  judged  by  visual  examination,  it  is  always  advantageous  to 
apply  thresholding  to  the  filter  output.  This  action  eliminates  the  possibility 
of  " false  detection  ” or  " miss  " tyr  giving  a bright  spot  in  a dark  background 
where  the  desired  object  is  present  on  the  input  image. 

Examples  of  candidate  objects  to  be  detected  along  with  the  matched  filter 
transfer  functions  which  are  matched  for  detecting  those  objects  and  the  corres- 
ponding outputs  are  shown  in  Fig. (2B-4) . 


SOME  PROPERTIES  OF  THE  TWO-DIMENSIONAL  MATCHED  FILTER  i 


Suppose  that  a matched  filter  is  matched  to  the  signal  f(x,y)  and  its 
offset  has  been  chosen  to  be  the  origin  of  the  output  plane.  Then, 


h(x,y)  = -tj-  f(-x,-y) 
N0 


(2A-36) 


and  the  matched  filter  output  will  be, 


T OO 

gQ(x,y)  = -§-  JJf( a,j8)  f(fl  - x,jj-  y)  dad^ 

Q__  oo 
+ oo 

+ JJn(  a,{3)  f(a-  x,  /?-  y)  da  d/3 


n 

U — oo 


(2A-37) 


in  which  the  SNR  peak  occurs  at  x = 0 and  y = 0.  Now  if  the  matched  filter  in- 
put contains  a translated  version  of  the  object  image,  we  have, 


g(x,y)  = f (x  + A x,  y+Ay)  + n(x,y) 


(2A-38) 


and  the  matched  filter  output  is, 


g0(x,y)  = -§-  j[jff(a+Ax,  13+  Ay)  f(a  - x,/?  - y)  da  d/3 


-§—  JJn(x  ,y)  f(  a - x,  y)  da  d(3 
0 — oo 


(2A-39) 


In  this  case,  as  seen  from  Eq.  (2A-39).  the  SNR  peak  occurs  at  x =Ax 
and  y =Ay,  thus  indicating  the  translation  of  the  input  image  relative  to  the 
reference  image.  Hence  the  matched  filter  is  translation  invariant  and  any 
point  (f,V)  at  which  the  correlation  peak  occurs  in  the  output  plane  describes 
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the  position  of  the  detected  two-dimensional  image. 

Convolution  of  the  input  signal  with  the  matched  filter  response  searches 
the  entire  image  plane  for  a possible  match  and  so  if  there  are  multiple  tar- 
gets of  the  form, 


£f(x  - A x.  , y - ay.) 
i 1 1 


(2A-40) 


all  of  them  will  be  detected  by  use  of  the  same  matched  filter.  Fig.(2B-5)  gives 
an  illustration  of  the  above  fact. 


) 
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The  matched  filter  is  not  rotation  invariant  [Interim  Rep.  #3].  However 
there  may  be  rotated  versions  of  an  object  image  In  a photograph.  We  like  to 
know  how  a rotated  version  of  an  object  image  can  be  detected  by  the  same  match- 
ed filter  . Suppose  that  a matched  filter  is  matched  to  the  signal  f ( p , q ) 
(polar  coordinates).  A rotated  version  of  this  signal  which  is  represented  by 
f(  P»  6 cannot  be  detected  by  the  same  matched  filter.  The  only  way  to  get 

a peak  is  by  rotating  the  matched  filter  itself.  The  impulse  response  of  the 
rotated  matched  filter  will  be, 


h’(  p,  e ■/>  = h (p  ,0+  <f>) 

= f(-p.  e +<t>) 


(2A-41) 


Consequently,  if  it  is  also  desired  to  detect  rotated  versions  of  f ( p , g)  , the 

impulse  response  of  the  filter  should  be  rotatable.  The  value  of  <f>  in  Eq.(2A-Ul) 
is  varied  until  a maximum  for  SNR  is  obtained  Fig.(2A-3) 
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Fig 


(2A-3)  A matched  filter  to  detect  the 
rotated  version  of  an  object. 
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At  this  point,  let's  investigate  how  a matched  filter  which  is  matched  to 
a signal  f(x,y)  can  be  used  to  detect  a scaled  version  of  f(x,y).  The  size  of 
as  object  image  in  a photograph  depends  on  the  distance  from  which  the  picture 
has  been  taken.  Although  the  shape  of  an  object  may  be  preserved  in  pictures 
taken  from  different  distances  its  size  will  be  different  in  different  pictures. 
For  this  reason  we  are  interested  to  know  how  a matched  filter  can  be  modified 
to  detect  scaled  versions  of  an  object  image. 


In  [Appendix  l],  a scaled  version  of  a function  f(x,y)  has  been  defined 
as  f(kx  , ky) . A matched  filter  whose  impulse  response  is  Riven  by. 


h(x,y,k) 


(2A-42) 


can  detect  scaled  versions  of  the  image  f(x,y).  The  matched  filter  is  built  in 
a way  that  parameter  Jc  can  be  varied.  The  output  is  examined  In  a maximum 
likelihood  sense  and  the  largest  output  is  selected.  Pig.  (2A-4) . 


input  image 


output 


select  the  maximum 


Fig  (2A-4)  A matched  filter  to  detect  the 
scaled  version  of  an  object. 


The  assumption  discussed  in  Appendix  1 , that  the  origin  of  x - y plane, 
should  be  inside  the  curve,  does  not  result  in  loss  of  generality.  Since  the 
matched  filter  is  translation  invariant,  it  detects  the  scaled  version  of 
f(x,y)  even  if  it  does  not  encircle  the  origin. 

A matched  filter  with  two  variable  parameters  can  detect  any  scaled,  ro- 
tated or  translated  version  of  the  image  f(x,y).  The  transfer  function  of  such 
a filter  in  polar  coordinates  will  be, 

h(P.<9  . k ,4>  ) = t(-  fp  , 6 + / ) (2A-43) 


input  image 


Fig,  (2A-5).  A matched  filter  to  detect  the  translated, 
rotated  or  scaled  version  of  an  image. 


The  normal  matched  filter  discussed  in  this  section  has  a basic  limita- 
tion which  is  especially  important  in  feature  extraction.  The  output  of  this 
matched  filter  is  primarily  dependent  on  the  energy  of  the  images  rather  than 
their  spatial  structure.  A hexagonal  object  can  be  easily  mistaken  by  a circle 
of  the  same  brightness  and  the  same  size.  In  general  normal  matched  filter  pro- 
vides relatively  poor  discrimination  between  objects  of  different  shape,  but  of 
similar  size  or  energy  content. 

Suppose  that  in  a given  picture,  we  are  looking  for  a disk  with  radius  A 
and  constant  gray  level  G.  If  the  center  of  this  disk  is  located  at  a point 
( i , V ) , we  have 

f(x,y)  = C for  ^(x  -£)2+  (y  - t,)2<  a (2A-44) 

A normal  matched  filter  designed  to  detect  this  signal,  will  have  the  same 
structure  and  will  give  us  a peak  of  the  amount  C2U  A2  at  the  point  ( f , tj  ) . 

As  we  have  mentioned  before,  matched  filtering  is  usually  followed  by  threshold- 
ing, so  that  the  positions. of  the  peaks  are  detected  more  clearly.  The  above 
result  would  be  excellent  if  in  the  output  picture  (processed  by  the  matched 
filter)  all  other  gray  levels  are  smaller  than  C2riA2  , namely 


2 n . 2 


h(x,y)  * f (x,y)<  C^flA 


for  x 


h-4' 


(2A-45) 


then  thresholding  at  a rate  C2IIa2  - * (small  e ) will  locate  the  center  of  the 
circle.  But  in  a real  picture,  there  may  be  disks  of- .different  gray  levels  and 
different  sizes  and  even  other  configuration  which  after  being  processed  by  the 
matched  filter  in  (2A-44) , would  give  larger  peaks  than  C2I1  A2  . How  can  we  dis- 
criminate among  these  peaks  and  choose  the  one  related  to  our  desired  circle  ? 


' ~ ' -r  . *-  r*t 


Certainly  there  is  not  a unique  technique  to  avoid  all  these  problems,  and  more- 
over some  of  them  are  related  to  the  shortcoming  of  the  normal  matched  filter 
itself. 


i 


To  overcome  the  problem  of  getting  some  peaks  larger  than  the  one  corres- 
ponding to  our  signal,  a normalization  technique  on  the  original  image  will  be 
useful.  Suppose  before  using  the  matched  filter  in  (2A-44),  we  normalize  the 
picture  as  follows, 

|f(x,y)  for  f(x,y)<C 

(2A-46) 

C for  f(x,y)>  C 

The  above  normalization  will  guarantee  that,  no  part  of  the  picture  will 
have  a gray  level  above  C,  and  in  the  output  the  largest  possible:  peak  corres- 
ponding to  our  desired  object.  The  value  of  C can  be  chosen  as  the  average  gray 
level  value  of  the  picture. 

Even  using  the  normalization  technique  in  (2A-46)  one  cannot  be  sure  that 

2 2 

a peak  of  value  C FI  A will  correspond  to  a disk  with  radius  A and  gray  level  C. 


< 


To  overcome  all  the  difficulties,  related  to  the  normal  matched  filter,  one 
may  think  of  applying  an  edge  detection  technique  to  a picture  before  It  would 
be  processed  by  the  matched  filter.  In  this  manner,  the  edge  structure  of  an 
object  will  be  used  for  its  detection. 


Suppose  f(x,y)  is  an  object  to  be  detected,  its  gradient  is  given  by, 


Vf(x,y)  = x + -|£-  y (2A-47) 


the  squared  magnitude  of  this  gradient,  ( gx  ) + (~g^ — ) can  be  used  as  an 

edge  detector.  This  edge  detector  will  .work  excellent  if  the  gray  level  is  con- 
stant across  the  object.  If  F(u,v)  is  the  Fourier  transform  of  f(x,y), 

(2A-48) 


v2A-49) 


(2A-50) 

and  the  desired  matched  filter  will  have  an  impulse  response  given  by, 


then 


f(x,y)  F(u,v) 


f(x,y)  <— ■ ;>  -iu  F(u,v) 


8 x 


8y 


f(x,y)  > -iv  F(u,v)  ] 


and 


/ 8f  \2  / 8f  \2  / 2 2\  „ / \ 

+ (~ g ~)  = ~(u  + v ) F(u,v) 


hg(u.v)  = "(^2  + v2)  Hn(u,v)  (2A-51) 

where  H^(u,v)  is  the  impulse  response  of  the  corresponding  normal  matched  fil- 
ter. Since  matched  filtering  is  usually  done  on  a digital  computer,  the  imple- 
mentation of  an  edge  detecting  matched  filter  will  not  be  much  more  difficult 
than  a normal  matched  filter. 

The  introduction  of  the  gradient  matched  filter  brings  about  another  diffi- 
culty which  was  not  much  severe  in  using  the  normal  matched  filter.  The  problem 
is  that  since  we  do  not  usually  have  any  a priori  information  about  the  orien- 
tation of  the  object  to  be  detected,  we  have  to  build  our  matched  filter  with 
the  capability  of  being  rotated.  This  subject  was  discussed  before.  The  whole 

(33) 


process  is  done  on  a digital  computer  and  the  angle  of  rotation  is  changed. dis- 
cretely until  a large  peak  is  detected. 

Using  a gradient  matched  filter,  the  above  process  becomes  more  difficult, 
because  supposing  that  there  is  a rotated  version  of  the  target  in  a picture 
under  consideration,  it  is  very  improbable  that  it  will  overlap  our  matched  fil- 
ter precisely,  no  matter  how  fine  we  choose  the  steps  in  rotating  the  matched 
filter.  See  Fig.  (2B-7) 

The  above  problem  can  be  somehow  avoided  by  producing  a fuzzy  version  of 
the  matched  filter  response  Fig.  (2B-8) . As  an  example  a fuzzy  version 

of  a ring  with  equation 

f(x,y)  = G 

will  be, 

f(x,y)  = G 

where  e is  a small  number. 

The  above  technique  can  help  to  detect  a rotated  version  of  a target,  even 
if  it  does  not  completely  overlap  our  filter. 

Throughout  this  section,  the  object  to  be  detected  f(x,y)  was  always  assum- 
ed deterministic.  If  the  state  of  f(x,y)  is  only  known  statistically,  the  match- 
ed filtering  concept  can  be  extended  to  the  detection  of  a stochastic  image 
field  in  the  presence  of  noise.  See  [Interim  Rep  # 6]  and  (8). 


2 ? 2 

for  x + y = R 

for  (R  -<r)2<  x2  + y2<  (R  + e )2 


A two-dimensional  matched  filtering  process  can  be  described  in  an  all 


digital  system.  Because  the  computer  has  a tremendous  signal -to -noise  advantage 
over  physical  systems  and  because  there  is  no  need  to  implement  the  filtering 
process  on  energy  sensitive  films,  the  matched  filter  can  simply  be  construct- 
ed with  two-dimensional  Fourier  transformation  and  conjugate  operation  of  the 
signal  to  be  detected.  The  actual  filtering  operation,  then  is  obtained  by 
Fourier  transforming  the  input  plane,  multiplying  by  the  filter  plane,  and  then 
inverse  transforming  the  product. 

The  above  technique  is  not  the  only  method  for  building  a two-dimensional 
matched  filter.  In  fact,  the  idea  of  building  the  matched  filter  optically  is 
more  exciting  than  using  a digital  computer  . For  optical  data  processing 
systems,  implementation  of  a true  matched  filter  can  be  extremely  difficult. 
This  is  because  the  filter  contains  both  modulus  and  phase  information  which 
somehow  must  be  placed  on  some  form  of  light  sensitive  material.  For  more  on 
this  subject  see  [Interim  Rep  # 6]  and  [9  - 10]. 


EXTRAPOLATION  TECHNIQUES 

Suppose  a part  of  a photograph  has  been  destroyed  by  some  unwanted  pheno- 
menon (e.g. noise).  Is  there  any  way  to  reconstruct  the  damaged  part  and  extract 
the  features  hidden  in  that  portion  ? This  question  may  be  answered  by  extra- 
polation of  bandlimited  signals  discussed  in  this  section. 


Very  often  it  is  desirable  to  extrapolate  the  unknown  part  of  a signal  f(t) 
in  terms  of  a known  finite  segment  g(t)  where, 


e(t)  = 


f(t) 

0 


lt|  CT/2 


|t|  > T/2 

(35) 


(2A-52) 


In  general  this  problem  is  not  solvable,  but  if  the  function  f(t)  is  con- 
strained to  be  bandlimited,  there  are  neat  solutions  to  the  problem. 

Let  be  the  Fourier  transform  of  f(t).  If  F(w)  vanishes  for  |a>|>ft 

then  f(t)  is  said  to  be  bandlimited  and  ft  is  called  the  cutoff  angular  frequenc 

An  example  of  f(t)  and  g(t)  is  shown  in  Fig  (2A -6) 


Fig(2A-6)  An  example  of  a bandlimited  signal. 
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As  we  mentioned  earlier,  in  the  case  of  bandlimited  f(t),  there  are  differ- 


ent ways  to  extrapolate  the  unknown  part  of  f(t)  in  terms  of  g(t).  One  could, 
for  example,  calculate  successive  derivitives  of  f(t)  at  some  point  in  the  inter- 
val ( - T/2  , T/2  ) and  form  a Taylor  series  representation  which  would  cover 
everywhere.  But  since  in  practice,  such  a Taylor  series  has  to  be  truncated 
somewhere  along  the  line,  the  resultant  approximation  would  give  a very  poor 
approximation  of  f(t)  specially  for  large  values  of  t,  and  moreover  it  will  not 
be  bandlimited. 

Another  approach  is  to  try  to  find  a complete  set  of  bandlimited  functions 
^(t)  which  are  orthogonal  on  the  Interval  (-T/2,  T/2)  as  well  as  on  the  entire 
real  line.  Using  these  functions,  one  can  expand  g(t)  in  terms  of  ^(t)  in  the 
interval  (-  T/2  , T/2  ),  and  then  for  any  other  t outside  this  interval,  the 
expression  gives  the  value  of  f(t)  at  that  point. 

Fortunately  such  functions  have  been  found  and  are  called  Prolate  Sphe- 
roidal functions.  [Appendix  II]  or  [11  - 12] 

In  terms  of  the  above  functions,  f(t)  can  be  written  as, 


f(t)  = fan  (t) 


(2A-53) 


and  the  coefficients  an  can  be  expressed  in  terms  of  g(t)  and  the  abovo  func- 
tions , 


8(t)  ’ 


(2A-54) 


or 


*n  * / s(t)  *■> (t) 


(2A-55) 


It  follows  that  F(u),  the  Fourier  transform  of  f(t)  is  given  by  [13] 


F(u)  = K | an^”  *>n  (ku) 


(2A-36) 


in  which 


The  idea  of  analytic  continuation  of  a signal  can  be  also  used  in  image  proce- 
ssing [14]. 

An  image  is  a two-dimensional  signal  of  limited  extent, 


f(x,y)  = 0 


|x|  > A or  |y|  > B (2A-57) 


Let  F(u,v)  be  the  Fourier  transform  of  f(x,y), 


' 


+A  +B 


:u,v>  = / /, 


f (x,y)  exp  [-j2n(ux+  vy)  ] dx  dy 


-A  -B 


If  F(u,v)  is  only  known  over  3ome  passband 


(2A-58) 


(2A-59) 


(2A-60) 


r - - 


CO  CO 


F(u,y)  = £ £ *i(j  ^i(u)  *2j  M V u,i 

i = 0 j = 0 

It  is  easy  to  show  that  the  coefficients  a.  . are  given  by, 


+ 1 

2 + 2 


Xli  X2j 


U V 
2 " 2 


(2A-61) 


/A  i(u>  F(u,v)  du  dv  (2A-62) 


the  implicit  dependence  of  * (u)  on  UA  and  of  *Jv)  on  VB  has  not  been  shown 

2 * 2 

For  many  years,  the  classical  Rayleigh  diffraction  limit  of  resolution 
was  considered  as  a theoretical  limit  beyond  which  no  higher  spatial  frequen- 
cies could  be  resolved  (in  optical  system) . But  as  we  see  from  Bq.(2A-6l),  with 
some  a priori  knowledge  (finite  limit  in  space  of  the  original  object)  it  is 
possible  to  go  beyond  the  diffraction  limit. 


While  the  technique  of  analytic  continuation  by  the  above  method  is  valid 
theoretically,  there  are  some  limitations  for  practical  purposes.  First,  in 
Eq  (2A-53),  the  series  should  be  truncated  somewhere,  and  so  the  truncation 
error  exists.  However  since  the  eigenvalues  are  in  a decreasing  order  as  a func- 
tion of  index  i,  [Appendix  II]  the  mean  square  error  is  minimum  in  a Karhunen- 
Loeve  expansion  sense  [Interim  Rep#  3].  Secondly,  since  practically  any  image 
is  accompanied  by  noise,  F(u,v)  in  Eq.  (2A-62)  will  not  be  the  pure  Fourier 
transform  of  f(x,y)  and  so  the  coefficients  a^  in  the  expansion  will  be  degra- 
ded by  noise.  A very  large  image  signal-to-noise  ratio  is  needed  to  permit  con- 
tinuation to  just  a few  spatial  frequencies  beyond  the  diffraction  limit.  The 
computational  difficulty  of  Prolate  Spheroidal  functions  is  another  problem  in 


the  utilization  of  the  discussed  method. 


Another  method  for  extrapolation  of  handlimited  signals,  known  in  a finite 
interval,  has  been  proposed  by  Papoulis  [13].  This  method  builds  up  the  unknown 
part  of  the  signal  by  successive  iterations. 

As  before,  let  g(t)  be  the  known  part  of  the  signal  f(t)  in  the  interval 
(-  T,  T) . f(t)  is  bandlimited  and  we  have, 


F(oo)  = 0 


for  |u|  > ft 


(2A-63) 


The  Fourier  transform  of  g(t)  is  called  G(«*>), 


i * 

G(w)  = /«(«.)  exp  [-jut]  dt 


(2A-64) 


Gq(<u)  is  put  equal  to  G(cu)  and  is  put  to  zero  for  |uj|  ^ il  , the  new  func- 
tion is  called  F^oj).  The  inverse  Fourier  transfor  of  F^w)  is  taken  and  f(t) 
is  formed.  For  |t|  < T,  f^t)  is  replaced  by  g(t)  and  the  new  function  is 
called  g1(t)  and  so  on.  The  n th.  iteration  step  comprises  the  following 

K-it")  l"|s;n 

i • F„<“'  = I H>n  (2A_65) 


+ 1 1 

fn(t)  = y*Fn(w)  exp  [jot]  dw 


(2A-66) 


-3tc  .*• 


*.  i ■ •.  Sr  K» 


Fig.  (2A-7)  shows  a realization  of  the  above  method  using  an  ideal  low-pass 


Fig(2A-7)  The  iterative  method  of  signal  extrapolation 


If  we  show  the  error  at  the  n th  iteration  step  by, 


en(t)  = f(t)  -fn(t) 
it  can  be  shown  that  as  n — ►«> 


(2A-69) 


e (t)  *-  0 (2A-?0) 

n%  ' 

f (t)  — ► f(t)  (2A-71) 

n 

The  number  of  necessary  iterations  for  finding  a good  fit,  depends  on  T and 
increases  as  ftT  decreases.  This  method  as  any  other  method  is  subject  to  some 
errors  which  are  discussed  in  the  original  paper.  Compared  with  the  technique  of 
prolate  Spheroidal  functions,  the  iteration  method  is  more  efficient  in  using 
less  amount  of  storage  space  and  fewer  computations.  By  looking  at  Eq.  (2A-56), 
we  see  that  in  order  to  evaluate  F(o>)  by  the  first  method,  one  must  store  the 
sample  values  of  the  functions  ^(u)  and  use  them  to  evaluate  the  integral 
(2A-55) . In  this  method  no  storage  is  necessary,  and  since  only  discrete  Fourier 
series  are  involved,  the  economy  of  Fast  Fourier  Transform  (FFT)  can  be  used. 


Based  on  the  iteration  technique  for  extrapolation  of  bandlimited  signals, 
smother  technique  has  been  proposed  in  a paper  by  Sabri  and  Steenaart  [15].  In 
this  technique,  the  total  extrapolation  process  is  achieved  by  a single  matrix 
operation.  The  matrix  is  called  the  Extrapolation  Matrix.  The  proposed  techni- 
que and  its  implementation  has  some  advsmtages  over  the  other  extrapolation 
techniques  in  terms  of  the  number  of  computations  and  accuracy  of  results.  Mor- 
ever  it  can  be  implemented  on  a real-time  basis. 


(42) 


The  idea  of  extrapolation  of  bandlimited  signals  in  terms  of  a known  seg- 


ment, has  been  mainly  used  in  image  processing  for  super,  resolution.  However 
since  this  paper  is  devoted  to  cartography  and  the  associated  problem  of  fea- 
ture extraction,  it  will  be  nice  to  see  how  the  extrapolation  technique  can  be 
used  to  enhance  hidden  details  of  an  image.  Suppose  we  have  a picture  which  is 
bandlimited  (in  practice  any  picture  can  be  considered  bandlimited).  A part  of 
this  picture  has  been  degraded  by  noise,  and  unfortunately  this  is  the  part  in 
which  we  are  most  interested.  We  are  looking  for  a scheme  which  can  bring  out 
and  enhance  the  features  hidden  in  that  part.  One  plausible  approach  is  to  use 
the  above  discussed  extrapolation  technique.  Since  the  picture  is  bandlimited, 
we  can  pick  a non-degraded  extent -limited  portion  of  the  picture  and  extrapolate 
the  whole  picture  in  terms  of  that  section.  In  this  way  we  find  a description 
for  the  part  of  the  picture  which  was  hidden  in  noise. 


Usually  the  noise  is  not  limited  to  one  part  of  a picture,  but  is  present 
everywhere  and  will  be  spread  by  extrapolation.  To  avoid  its  effect,  one  can 
use  the  randomness  nature  of  noise  and  perform  the  above  technique  many  times 
using  different  portions  of  the  picture  as  a starting  point.  The  results  are 
averaged  to  give  a good  approximation  of  the  part  of  the  picture  to  be  enhanced. 


2B  - EXAMPLES 


In  section  2A,  the  analytical  properties  of  Fourier  transforms,  Hankel  trans- 
forms and  the  matched  filter  were  investigated.  To  see  how  these  transformations 
act  on  different  patterns,  some  candidate  features  from  the  objects  usually  pre- 
sent in  photographs  will  be  chosen  and  we  will  try  to  find  the  corresponding 
transformed  function  under  the  application  of  different  transforms.  On  this  basis 
we  will  be  able  to  compare  different  transforms  with  respect  to  their  capability 
of  bringing  out  specific  features.  For  Fourier  and  Hankel  transforms  we  can 
usually  get  a closed  analytical  form  for  the  transformed  function  and  the  analy- 
sis will  be  based  on  this  closed  form.  For  matched  filters,  we  like  to  see  how 
the  position  or/and  value  of  the  output  peak  is  affected  by  the  input. 

Aerial  photographs  usually  contain  a variety  of  object  images.  Some  of 
these  objects  have  some  known  geometrical  shape  ( or  at  least  can  be  approxima- 
ted in  that  way),  and  can  be  expressed  by  some  analytical  functions.  However, 
there  are  many  others  which  do  not  possess  such  a property.  Here  we  select  some 
simple  but  common  cartographical  features  and  try  to  find  how  they  are  affected 
by  the  picture  transforms  * 

The  following  patterns  seem  satisfactory  : 

(a)  Slanted  lines 

(b)  Shaded  edges 

(c)  Centered  circles 

(d)  Cluster  of  circles  with  circular  symmetry 


Slanted  lines  are  very  commonly  seen  on  an  aerial  image.  They  can  have 


different  extents  and  may  appear  parallel  to  each  other.  Images  of  roads,  streets 


highway,  bridges,  etc.,  appear  as  slanted  lines  on  a photograph 


Any  object  which  appears  as  a line  on  a picture  taken  from  a far  distance 


may  appear  as  a strip  on  a picture  of  the  same  scene  taken  from  a closer  dis 


tance.  By  a shaded  edge  we  mean  a strip  whose  brightness  gradually  changes  along 


its  width 


Circles  on  a photograph  generally  suggest  the  presence  of  storage  containers 


in  the  scene.  If  there  are  more  than  one  storage  container  on  a spot,  they  usu 
ally  show  circular  symmetry  on  the  picture.  This  suggests  to  see  how  a parties 
lar  transform  affects  the  patterns  of  several  circles  with  circular  symmetry. 


EXAMPLES  FOR  FOURIER  & HANKEL  TRANSFORMS 


The  visual  examination  of  a transform  is  usually  limited  to  a display  of  its 
magnitude.  In  the  Fourier  transform  this  magnitude  is  called  the  Fourier  spectrum 


angle  if  the  original  image  is  rotated  by  the  same  angle  (see  section  2A) 


In  accordance  with  the  above  remark: 


their  simplest  form.  It  will  be  easy  to  visualize  the  transformation  of  their 


In  Fig  (2B-2)  we  have  shown  the  Fourier  transform  of  some  candidate  func 


tions 
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FIG . ( 2 B - 2 ) Candidate  I- unctions  and 
their  Fourier  Transform 


(a)  The  Fourier  transform  of  a line  segment  located  on  the  x axis  is  given 
by  a sine  function  along  the  u axis.  The  period  of  the  zeroes  of  this  function 
depends  on  the  length  of  the  segment  and  gets  smaller  as  the  length  is  increa- 
sed. No  variation  exists  along  the  v axis. 

(b)  Here  we  have  two  parallel  line  segments  seperated  by  a distance  2b.  Its 
Fourier  transform  is  a function  as  in  part  (a)  multiplied  by  a cosine  function 
that  varies  in  a direction  normal  to  the  lines.  The  period  of  the  cosine  func- 
tion is  proportional  to  the  distance  between  the  lines. 

(c)  A strip  of  light  has  been  considered  and  as  we  see  the  transformed 
function  has  variations  in  both  u and  v direction.  The  variations  behave  as 
sine  functions. 


(d)  In  this  pattern  the  brightness  intensity  is  maximum  at  the  center  of 
the  strip  and  is  gradually  decreased  as  we  approach  the  edges.  The  behaviour  of 
the  transformed  function  in  the  u direction  is  the  same  as  in  (a),(b),  and  (c). 
Along  the  v axis  the  function  varies  as  the  square  of  the  sine  function. 

In  Fig  (2B-3)  the  Hankel  transform  of  a circle,  a disk  and  the  Fourier 
transform  of  a cluster  of  circles  with  circular  symmetry  axe  shown. 

(a)  The  Hankel  transform  of  a circle  is  given  by  the  Bessel  function  of 
zero  order.  The  behaviour  of  the  function  has  been  shown  in  Fig  (2A-1).  The 
argument  of  the  Bessel  function  is  (aw)  and  depends  on  the  radius  of  the  circle. 
The  transform  is  circularly  symmetric. 


(b)  A disk  of  radius  a has  been  acted  on  by  the  Hankel  transform.  The  re-- 
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FIG  (2B-3)  Candidate  functions  and  their 
Hankel/Fourier  Transform 


suit  is  the  Bessel  function  of  first  order  divided  by  its  argument  (X)  is 
shown  in  Fig  (2A-l) . The  transformed  function  possess  circular  symmetry. 

(c)  and  (d)t  In  (c)  the  Fourier  transform  of  two  circles  and  in  (d)  the 
Fourier  transform  of  four  circles  with  circular  symmetry  has  been  shown.  Since 
the  function  in  (c)  and  (d)  are  functions  of  two  variables  r and  Q,  their  trans- 
forms won't  be  circularly  symmetric.  The  Fourier  transform  of  the  two  circles 
in  (c)  is  the  multiplication  of  the  Fourier  transform  of  one  circle  (a)  by  a 
cosine  function  which  varies  along  the  common  axis  of  the  two  circles.  Notice 
that  this  behaviour  is  analagous  to  the  behaviour  of  two  parallel  lines  in 
Fig  (2B-2).  In  (d)  we  see  that  the  transformed  function  of  the  circle  in  (a) 
has  been  multiplied  by  two  cosine  functions.  Each  of  these  two  cosine  functions 
varies  along  the  common  axis  of  those  two  circles  whose  centers  are  symmetric 
relative  to  the  origin.  It  is  easy  to  show  that  if  the  common  axis  of  two  cir- 
cles makes  an  angle  0 with  the  x axis,  then  the  argument  of  the  cosine  function 

will  appear  as  Ru  Cos  9 in  which  R is  the  distance  between  centers  of  the  two 
2 

circles. 

EXAMPLES  FOR  MATCHED  FILTERING 


In  matched  filtering  method,  it  is  not  necessary  to  select  any  specific 
features  in  order  to  show  the  power  of  the  technique.  Since  a specific  matched 
filter  can  be  built  to  detect  any  object  image  in  a picture,  any  example  will 
be  as  useful  as  any  other.  In  Fig  (2B-4),  some  examples  have  been  shown.  Part 
(a)  of  the  figure  shows  some  object  images  to  be  detected.  In  part  (b)  the  co- 
rresponding matched  filter  transfer  function  has  been  shown  and  part  (c)  shows 
the  corresponding  outputs. 

(50) 


In  Fig  (2B-5)  the  detection  of  multiple  taxgets  by  the  same  matched  filter 


has  been  shown  . As  it  was  mentioned  in  section  2A,  this  property  is  based  on 
the  fact  that  the  convolution  of  the  input  signal  with  the  matched  filter  res- 
ponse searches  the  entire  image  plane  for  a possible  match.  Fig.(2B-6)  shows 
some  problems  associated  with  the  normal  matched  filter,  (a)  shows  our  matched 
filter  impulse  response  which  is  kept  fixed  for  different  inputs.  This  matched 
filter  has  been  designed  to  detect  the  target  shown  in  (b)  and  the  correspond- 
ing  output  peak  would  be  of  the  value  C 17  A . In  (c),  (d)  and  (e)  we  have  shown 
some  other  inputs  which  when  applied  to  the  matched  filter  in  (a),  will  give  the 
same  peak  at  the  output.  This  in  turn  will  result  in  a " false  " detection.  In 
section  2A  we  showed  how  these  problems  can  be  avoided  by  either  some  kind  of 
normalization  of  the  input  image  or  by  using  a gradient  matched  filter. 

In  Fig  (2B-?) , we  have  shown  the  problem  that  may  arise,  when  a gradient 
matched  filter  is  rotated  in  discrete  steps  in  order  to  detect  a rotated  ver- 
sion of  a target.  The  matched  filter  may  never  overlap  the  target  as  it  was 
discussed  in  section  2A.  Fig.  (2B-8)  shows  how  the  above  problem  can  be  over- 
come by  using  a fuzzy  version  of  the  matched  filter  inpulse  response. 

'I 

CONCLUDING  REMARKS 


In  this  chapter,  the  analytical  properties  of  Fourier  and  Hankel  transforms 


and  matched  filtering  process  were  studied  and  some  examples  were  used  to  en- 
hance those  properties  . The  topic  of  extrapolation  techniques  was  only  touched 
briefly  and  it  was  suggested  how  that  can  be  used  in  bringing  out  hidden  parts 
of  a picture.  Our  emphasis  was  put  on  to  see  how  different  transformations  act 


on  different  patterns.  Moreover,  an  important  aspect  of  any  transformation  be- 
ing used  in  feature  extraction,  is  how  that  transform  behaves  when  the  input 


. V-_  TT—SfcS  V - ' 


U f(x,y)=c 


h(x,y)-f(-x,-y) 


(a)  matched  filter  input  /lx 

ibj  matched  filter  transfer  functi 


f (x,y)*h(x,y) 


(c)  matched  filter  output 

FIG(  2B- 5 Petection  of  multiple  targets  by  the  matched 

filter  in  (b) 


FIC(2B-  7)  The  rotated  versions  of  a matched  filter  designed  to  detect 
the  triangular  Image  shown  above.  Each  time  the  M.F.  Is 
rotated  by  the  amount  22.5,  and  since  It  never  completely 

I 

overlaps  the  target,  the  object  cannot  ue  detected. 

I 
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a fuzzy  version  of  the 
matched  rilter 
impulse  response 


The  drawback  of  the  matched  filter  in 1 1 G ( 2 B-  7 ^aa  been  overci 


Here  one  of  the  rotated  versions 


by  using  a fuzzy  version  of  it 
overlap  the  object  ismge  to  be  detected 


image  is  translated,  rotated  or  scaled.  This  subject  was  also  studied  and  the 
following  table  compares  different  transforms  in  the  above  sense. 


Transform 

Translation 

Invariant 

Rotation 

Invariant 

Size 

Invariant 

Fourier 

(Spectrum) 

O.K 

The  spec- 
trum ro- 
tates 

The  spec- 
trum scaled 

Hankel 

O.K 

O.K 

t • 

Matched 

O.K 

Gan  be  mo- 
dified to 
do  so 

Cam  be  modi- 
fied to  do  so 
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Cartographic  objects  have  different  characteristics  and  so  can  be  categor- 
ized according  to  some  specific  properties.  As  an  example,  roads  and  rivers, 
on  a cartographic  picture  look  like  a line.  Some  objects  may  look  like  circles, 
rectangular  patches  and  some  geometrical  shapes.  The  discrete  transforms  when 
operated  on  any  of  those  objects,  sometimes  give  some  particular  patterns  or 
salient  features  representative  of  the  original  objects  of  cartography  in 
transform  domain.  Since  the  geometry  and  the  object  are  usually  represented  in 
discrete  numbers  it  is  logical  to  attempt  to  resort  to  discrete  transform 
techniques  for  feature  extraction  of  these  objects.  Additionally,  discrete 
transforms  lend  themselves  readily  to  easy  implementation.  This  chapter  will 
concentrate  on  an  exposition  of  discrete  transforms  and  examples  which  may  have 
some  relevance  to  cartographic  analysis. 

To  process  a picture  f(x,y),  this  picture  is  usually  sampled  on  a grid 
of  M X N points,  each  number  in  the  array  representing  the  gray  level  in  the 
picture  at  the  corresponding  sampling  point.  Thus,  a picture  can  be  represen- 
ted by  a matrix,  [f],  the  elements  of  which  are  f(x,y). 

A general  discrete  transform  on  this  picture  is  given  by, 

M N 

F(u,v)  = £ £ f(x.y)  g(x,y,u,v) 

x = 0 y = 0 

F(u,v)  is  the  transform  with  u,v  taking  on  integer  values  from  M to  N respec- 
tively and  g(.)  is  the  Kernel  of  the  particular  transform.  The  inverse  trans- 
form if  it  exists  will  be, 


and  in  addition  is  symmetric  if  g , and  g„  are  functionally  equal 


The  same  argument  holds  for  the  inverse  kernel.  The  seperable  symmetric 
property  is  desirable  when  the  implementation  problem  comes  into  consideration 
Furthermore,  since  statistical  intensity  variations  of  most  images  an*  nearly 
the  same  in  the  vertical  and  horizontal  directions , only  seperable  symmetric 
kernels  usually  need  to  be  considered,  of  course,  as  we  will  see  later,  some 
^r^isforms  like  Slant  transform  are  not  symmetric  and  they  are  usually  used  to 
process  some  special  patterns. 


It  is  often  useful  to  express  two-dimensional  transforms  in  matrix  form. 
For  simplicity  let  us  take  M = N,  so  that  we  will  have  a square  array.  Then  for 
a transform  kernel  that  is  seperable  symmetric  let, 


[f]  = image  matrix 

[F]  = transformed  image  matrix 

[A]  = transform  matrix 


then  by  matrix  multiplication 


[F]  = [A][f][A] 

It  can  be  easily  shown  that  the  inverse  transform  is  given  by, 

[f]  = [A]-*[F][A]-1 

Thus  f(x,y)  and  F(u,v)  can  be  expressed  as  a two-dimensional  transform  if  [a] 
has  an  inverse 

If  [A]  is  a symmetric  orthogonal  matrix  then, 

[A]_1  = [A] 

If  a transform  kernel  is  seperable  but  not  symmetric,  then  we  have, 

[F]  = [A][f][A]T 

On  the  other  hand  if  it  is  to  be  a one  dimensional  transform  then  [F]  = [A][f] 
and  the  inverse  transform  is  [f]  = [A]**  [F]  if  [A]  is  not  symmetric  or  else 
[f]  = [A]  [F]  if  [A]  is  symmetric. 


One  can  easily  indentify  l/N  Wn  , l/N  , l/N  Sr  and  Dr  as  the  Kernel  A 


of  N X N dimension  in  the  case  of  Walsh  Hadamard,  Haar,  Slant  and  Discrete 
Cosine  transform  in  the  following  pages. The  image  matrix  [f]  is  represented  as 
a vector  f„  where 

fN  = £f0 

pixel  system  and  as 

Poo  f0i  f0N  -1 

fll  fl  N -1 

fM-ll  fM-l  N-l 

for  a two  dimensional  data  pixel  system. 

To  make  this  chapter  self  contained,  in  section  (A)  of  this  chapter,  a 
general  review  of  the  two-dimensional  discrete  transforms  is  discussed  along 
with  a brief  summary  of  their  properties.  The  Walsh  Hadamard  transforms  (WHT), 
the  Haar  transforms  (HT)  the  Slant  transforms  are  covered  in  this  chapter. 

The  discrete  transforms  seem  to  have  a decided  advantage  over  the  continuous 
transforms  from  the  point  of  view  of  the  Hardware/Software  implementation 
with  a Microprocessor  based  signal  processing  system  . In  section  B of  this 
chapter,  the  two-dimensional  transforms  of  the  candidate  features  of  the  aerial 
photographic  patterns,  are  compared  from  the  point  of  view  of  the  characteris- 
tic properties  of  feature  extraction. 


f = 
MN 


k10 


M-10 


f^  f^  ^ j ] for  a one  dimensional  data 

*MN  Where 


3A>  THEORY 


TWO-DIMENSIONAL  WHT  (l)t 

Walsh-Hadamard  transforms  are  used  for  the  Walsh  representation  of  data 
sequences.  Their  basis  functions  sure  sampled  Walsh  functions  which  can  be 
expressed  in  terms  of  the  Hadamard  matrices  Wfi  . These  matrices  can  be  gene- 
rated vising  the  following  recurrence  relation 


Wk  = 


’k  - 1 


'k  - 1 


'k  - 1 


-W 


k - 1 


k = 1,2,  . . . n 


(3A-1) 


where  Wq  = 1 and  n = log2N. 


For  example,  with  k = 1,  and  k = 2,  Eq.  (3A-l)  yields 


The  order  of  a Hadamard  matrix  need  not  be  a power  of  two.  Existence  of 
the  Hadamard  transforms  for  values  of  N other  than  integer  power  of  two  has 
been  shown  up  to  N = 200.  However,  most  of  the  applications  of  transforms  in 
image  processing  are  based  on  N = 2n  samples  per  row  or  column  of  an  image. 


4 


A frequency  interpretation  can  be  given  to  the  Hadamard  matrix  generated 


by  the  core  matrix  of  Bq.  (3A-l).  Along  each  row  of  the  Hadamard  matrix  the 
frequency  is  called  the  number  of  changes  in  sign.  The  word  " sequency  " is 
primarily  used  to  designate  the  number  of  sign  changes.  It  is  possible  to  con- 
struct a Hadamard  matrix  of  N = 2n  that  has  sequency  components  at  every  in- 
teger from  0 to  N - 1,  Fig.  (3A-1)  shows  the  Hadamard  matrix  of  order  8 along 
with  the  sequency  interpretation. 

MATRIX 

+ + + + + + + + 

+ -+  -+  -+  - 

+ + --  + + -- 
+ - - + + - - + 

+ + + + --  -- 

+ - + - _+  + 

+ + ----  + + 

+ --  +-  + + - 

Fig.  (3A-1)  i Hadamard  matrix  of  order  8 

A very  closely  related  transform  to  the  Hadamard  transform  is  the  Walsh 
transform.  For  N = 2n  , the  Walsh  transformation  matrix  is  found  by  inter- 
changing the  orders  of  the  rows  and  columns  of  the  corresponding  Hadamard 
matrix.  Since  this  is  the  only  difference  between  Walsh  and  Hadamard  matrices 
when  N = 2n  , the  use  of  the  Walsh  and  Hadamard  transforms  is  intermixed  in 
the  picture  processing  literature  and  hence  the  term  Walsh-Hadamard  transform 
is  used  to  denote  either  of  them. 
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SEQUENCY 

0 

7 

3 

4 
1 
6 
2 

5 


lc  Ic 

It  is  straightforward  to  show  that  the  matrices  of  2 rows  and  2 columns 
have  the  following  properties i 


(a)  is  a symmetric  matrix,  i.e. 


Wk  = "k 


0-2) 


prime  denoting  transpose. 


(b)  W.  is  orthogonal,  i.e. 


Wk  X "k  = 2 *k 


where  I.  is  the  (2^  X 2^)  identity  matrix. 


(c)  The  inverse  of  W.  is  proportional  to  itself,  i.e. 


-1  -k 

if  = 2 W 

"k  * "k 


where  is  the  inverse  of  W. 


Let  fN  denote  an  N-periodic  Data  sequence, 


= [f(0)  f(l)  f(N  - 1) 


( 3A.-3) 


where 


N = 2 


(3A-4) 


0-5) 


0-6) 


I 
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The  (WHT)  of  f is  defined  as 


F„  = ~tr~  V f„ 
N N n N 


(3A-7) 


Define  F(k)  as  the  kth  WHT  coefficient  and  let  F^  = [F(0),  F(l)....  F(N  - 1)]. 


From  (3A-4)  and  (3A-7)  it  follows  that  the  inverse  (WHT)  transform  (IWHT)  is 
defined  as 


fN  * WnFN 


0-8) 


Since  (3A-7)  and  (3A-8)  constitute  a transform  pair,  the  (WHT)  representation 
of  fn  is  unique.  The  two-dimensional  (WHT)  and  its  inverse  can  be  equivalently 
defined  in  matrix  form  as  follows j (the  details  are  covered  in  the  interim 
report  4) . 


For  a data  sequence  f of  M rows  N columns 


_ _1_ 


W_  f„„  W_ 


MN  MN  m MN  n 


and 


0-9) 


fMN  ~ Wm  FMN  WN 


where  M = 2m  and  N = 2n 


(69) 


1*. 


or  F, 


MN  = 


F(0,  0) 

F(l.  0) 


f(m  - 1,0) 


F(0,  1) 

F(l.  1) 


F(M  - 1,1) 


where  F(i,j)  is  the  (i,j)th  transform  coefficient. 


PROPERTIES  OF  WHTs 


F(  0,  N - 1) 
F(l,  N - 1) 


F(H  - 1,  N - 1) 


(l)  The  sequency  spectrum  of  a function  is  a measure  of  the  correlation  of  this 
function  with  the  Walsh  basic-patterns.  These  basic-patterns  were  used  to  ana- 
lyze the  Walsh  spectra  of  one  and  two-dimensional,  figures  and  relations  between 
image  samples  and  Walsh  spectrum  coefficients  were  established  [2]  (see  the 
Fig.  3A-2,  3A-3,  3A-4). 
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Fig.  (JA-2)  Sequency  Spectra  of  Walsh  Fig.  (3A-3)  One-Dimensional  " Basic 


Function 


SEQUENCES  — 0 


Fig.  (>-4)  Walsh  Basic  Patterns.  Basic  Pattern  (i,  j) 
is  Located  at  the  Intersection  of  Row  i 
and  Column  j.  -l's  have  been  Blanked 
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(a)  REFLECTION  (OR  IMAGE)  SYMMETRY 


Walsh  basic  patterns  (i, j)  with  even  j's  possess  reflection  symmetry 
about  y axis.  When  the  frame  is  folded  about  its  central  vertical  axis  a,,,  the 
right  half  of  the  pattern  matches  the  left  half  exactly.  Basic  patterns  with 
even  i's  possess  reflection  symmetry  about  x axis.  Since  every  sequency  coeffi- 
cient represents  the  correlation  of  the  input  pattern  with  a Walsh  basic- 
pattern,  a figure  with  reflection  symmetry  about  y axis  has  a sequency  spectrum 

whose  coefficients  b are  zero  for  all  odd  n.  For  a figure  with  reflection 

mn 

symmetry  about  x axis  the  coefficients  are  zero  for  all  odd  m.  For  a figure 
with  both  x and  y reflection  symmetries,  the  coefficients  b^  are  zero  for  all 
odd  m and  n. 

(b)  PERIODICITY 


Periodicity  about  y axis  corresponds  to  axis-symmetry  only  about  vertical 

axis  a2  . Walsh  basic-patterns  (1, j)  with  j = 0,3, 4, 7 are  periodic  about  y axis. 

If  a general  binary  two-dimensional  pattern  is  periodic  about  y axis,  the 

coefficients  b of  its  sequency  spectrum  are  zero  for  n = 1,2, 5, 6 and  for 
mn 

patterns  periodic  about  x axis  the  coefficients  are  zero  for  m = 1 ,2, 5,6 , when 
a pattern  is  periodic  about  both  x and  y axes,  coefficients  b^  with  m,n  = 

1.2, 5,6. 


(c)  REFLECTION  SYMMETRY  ABOUT  MAIN  DIAGONAL 


Since  basic-patterns  (i, j)  with  i = j are  symmetric  about  their  main  dia- 
gonal, the  diagonal  coefficients  of  a spectrum  give  a measure  of  unique  axis- 


I 


symmetries  and  reflection  symmetries  about  the  main  diagonal  of  a figure. 


fd)  GENERAL  SYMMETRY 


A pattern  possesses  general  symmetry  when  it  is  symmetric  about  both  x 
and  y axes  and  about  its  main  diagonal.  When  a symmetric  figure  is  centered  in 
a frame  so  that  the  x and  y axes  of  the  figure  are  the  x and  y axes  of  the 
frame,  it  yields  a sequency  spectrum  which  is  also  symmetric.  The  transpose  of 
the  spectrum  is  the  spectrum  itself  except  for  a scale  factor.  If  the  image  is 
markedly  unsyrametric,  the  transform  matrix  will  be  too. 


(e)  If  all  the  elements  of  the  image  are  equal  and  nonzero,  then  the  sequency 
spectrum  has  only  one  nonzero  term,  b^^  , Conversely,  a single  nonzero  results 
in  a sequency  spectrum  having  constant  magnitude.  Symmetry  features  may,  there- 
fore, be  extracted  from  a pattern  by  operating  on  its  Walsh  sequency  spectrum. 

(2)  Fast  WHT  algorithms  involving  Nlog2N  additions  and  subtractions,  do  exist. 
For  details  see  the  interim  report  # k. 

(3)  Let  the  data  sequence  fN  = [ fQ  f , fN  _ ^ be  subjected  to 

dyadic  shift  and  the  new  shifted  data  sequence  be  designated  as  f , = 

—d 

lo  -1  In  _ • Where  fk  = fj,(k  + s),  k = 0,1  N - 1 and  + 

denotes  modulo  2 addition.  The  following  table  illustrates  the  modulo  2 addi- 
tion for  N = 8 

Making  use  of  the  above  table  one  can  evaluate  f for  k = 0,1,  7. 


For  example 


—1  = *2  ^5  ^4  ^7  f^] 

. (73) 
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It  is  easy  to  show  that  the  dyadicaUy  shifted  WHTs  FN  are  related  to  in  the 
following  manner  : 


£»  - "hete  PN  - [po  Fl 

sional  WHTs. 


Fn  a set  of  one  dimen- 


Theref ore  the  WHTs  are  dyadic  invariant . For  details  see  ( 1 ) . 


(4)  Since  the  WHTs  are  dyadic  invariant,  one  can  develop  WHT  power  spectrum 
analogous  to  the  Fourier  representation.  For  example  when  N «=  8 the  power  spec- 
trum P for  n = 0,1  N/2  is  given  by 

n 


P0  = ^ 


pi  = + i 


(74) 
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Thus  the  general  form  of  the  above  spectrum  is  as  : 

P0  ■ *0 

Pk  = F2(2k  - 1)  + F2(2k)  k = 1,2  N/2  - 1 

P„/2  = 


Where  Pn  is  the  WHT  power  spectral  point  associated  with  the  order  of  sequency  n. 
Thus  this  spectrum  has  (n/2  + l)  points. 

The  WHT  phase  spectrum  is  defined  as 


*0  - o.n 

^N/2  = 2m  n + n/2  m = 0,1,2 

and 


Since  the  power  spectrum  is  dyadic  invariant  one  can  define  a correlation. 
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If 

N - 1 

!/"  E fk  «k 

h = 0 

^ h 

where 

k = 0.1 

. . N - 1 

then 

J?s> 

II 

Fk  where  Gk 

is  the  WHT  for  the  sequence  gfc  h 

k = 0, 

1 y • • • • 

Similarly  the  auto  correlation  function  is  defined  as  t 

N - 1 

If  fk±  l/N  E fk  fk  © h 


The  convolution  is  defined  as  i 

N - 1 

If  fk  = i/»  E 'k  «k  q h 

h = 0 

k = 0,1  N - 1 where  o denotes  the  modulo  2 subtraction. 

F,  = F.  G.  k = 0,1,  N - 1 

k k k 

Since  the  modulo  2 addition  and  subtraction  are  identical,  there  is  no  differ- 
ence between  dyadic  correlation  and  convolution.  Hence  the  linear  filtering  and 
matched  filtering  techniques  can  be  applied  in  the  domain  of  Walsh  Hadamard 
functions  for  signal  processing.  For  details  see  (4)  * (5)« 
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TWO-DIMENSIONAL  HAAR  TRANSFORM  (HT)  (l)  » 

The  Haar  transform  (HT)  coefficients  F(k),  k = 0,1,  ...,  N - 1 correspon- 
ding to  a data  sequence  f = [f(0)  f(l)  ......  f(N  - l)],  axe  obtained  by 

computing  the  transformation 

PN  = IT  Hn  fN  (3A-10) 

where  N = 2n 


where  Hn  is  the  (N  X N)  Haar  matrix.  is  obtained  by  sampling  the  set  of 
Haar  functions  [har(r,m,t)]  defined  in  Eq.  (3A-lla).  For  example,  the  (8  X 8) 
Haar  matrix  is  given  by  Eq.  (3A-llb) . 


har( 0 , 0 , t) 


har(r,  m,  t) 


t*.  [o,o 


< t > 


m - l/2 
2r 


(3A-lla) 


elsewhere  for  t « [0,1) 


where  0 ^ r < log^N  and  1 ^ m ^ 2r  . 


Sampling  of  the  set  of  Haar  functions  in  (3A~5a)  results  in  the  array  shown  in 
Fig.  (3A-5),  each  row  of  which  is  a discrete  Haar  function  Haar(r,m,t).  Arrays 
so  obtained  are  used  in  connection  with  the  Haar  transform  and  may  be  denoted 
by  Hn,  where  n = log2N. 

(77) 


tfx> 


Examining  H^  we  observe  that  N/2  coefficients  in  the  Haar  domain  measure  the 
adjacent  correlation  of  coordinates  in  the  data  space  taken  two  at  a time,  N/4 
measure  corrdinates  taken  4 at  a time,  etc.  up  to  N/N  coefficients  measuring 
all  the  N coordinates  of  the  data  space. 


The  two-dimensional  Haar  transform  of  a data  sequence  f^  of  M rows  and  N 
columns  is  : 


l/MN  Hm  fMN  HN 


H'  F„„  H where  M = 2m  and  N = 2n 
n MN  n 


( 3A-12) 


Ac  in  the  case  of  VfHT  transforms  F^  can  be  computed  using  the  one-dimensional 
Haar  transform,  a total  of  MN  times  as  i 


' MN  = 


F(0,  0)  F(0,  1) 

F(  1 , 0)  f( 1 , 1) 


F(M  - 1,  0)  F(M  - 1,  1) 


F( 0,  N - 1) 
F(l,  N - 1) 


F(m  - 1,  N - l) 


where  F(i,j)  is  the  (i  , j)th  transform  coefficient. 


(3A-13) 


PROPERTIES  OF  THE  HTr 


(l)  The  HT  provides  a domain  that  is  both  locally  sensitive  as  well  as  glo- 
bally sensitive.  In  the  case  of  the  discrete  Fourier  and  Walsh-Hadamard  trans- 
forms, each  transform  coefficient  is  a function  of  all  coordinates  in  the  ori- 


i 


(79) 


i 


. 


ginal  data  space  (global) , where  as  this  is  true  only  for  the  first  two 
Haar  coefficients. 

(2)  The  Cooley -Tukey  type  of  algorithm  to  compute  Fast  HT  requires  loggN  bit 
reversals  and  is  implementable  in  2(N  - l)  additions  and  subtractions  and 
N multiplications.  For  details  see  the  interim  report  # 4. 

(3)  The  HTs  are  dyadic  invariant  and  hence  possible  to  build  a matched  filter 
and  not  invariant  to  size  and  translation. 

TWO-DIMENSIONAL  SIANT  TRANSFORM  (l)  : 

A desirable  property  for  an  image  coding  transform  is  that  the  transform 
compacts  the  image  energy  to  as  few  of  the  transform  domain  samples  as  possible. 
Qualitatively  speaking,  an  efficient  energy  compaction  will  result  if  the  basis 
vectors  of  the  transformation  matrix  " resemble  " typical  horizontal  or  ver- 
tical lines  of  an  image.  In  the  case  of  a typical  monochrome  image,  a large 
number  of  the  lines  are  of  nearly  constant  gray  level  over  a considerable 
length.  The  Fourier,  Hadamard,  and  Haar  transforms  possess  a constant  valued 
basis  vector  that  provides  an  efficient  representation  for  constant  gray  level 
image  lines,  while  the  Karhunen-Loeve  transform  has  a nearly  constant  basis 
vector  suitable  for  this  representation.  Another  type  of  typical  image  line  is 
the  line  that  linearly  increases  or  decreases  in  brightness  over  the  length. 

None  of  the  data  transforms  previously  mentioned  possess  a basis  vector  that 
efficiently  represents  such  image  lines. 


4 


Orthogonal  transformations  containing  a " Slant  " basis  vector  are  intro- 
duced. The  slant  vector  is  a discrete  sawtooth  waveform  decreasing  in  uniform 

(80) 


steps  over  its  length,  and  is  suitable  for  efficiently  representing  gradual 
brightness  changes  in  an  image  line.  As  illustrated  in  Fig.  (3A -6).  The  slant 


transform  is  based  on  the  slant  vector  which  is  an  orthogonal  matrix  defined  by 


Fig.  (3A-6)  A Slant  Vector  for  N = 4 and  step  size  of  2 Units 


Slant  Matrix  Construction  : lx’  S, 


The  slant  matrix  for  N = 4 can  be  written  as 


where  a and  b are  real  constants  to  be  determined  subject  to  the  following 


conditions  i 


(1)  Step  size  must  be  uniform,  and 

(2)  S2  must  be  orthogonal 


The  step  size  between  the  first  two  elements  of  the  slant  vector  [see  second 
row  of  S9]  is 


(a  + b)  - (a  - b)  = 2b, 


and  the  step  size  between  the  second  and  third  elements  is 


(a  - b)  - (-a  + b)  = 2a  - 2b 


(3A-16) 


(3A-1?) 


which  leads  to  a = 2b 


Hence  S„ 


(3A-18) 


Using  the  orthogonal  condition 


-V[3b  t - b - 3b]  4=-  [3b  b - b - 3b]’  = 1 


we  obtain 
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. 5,  * *.  Vmi 


Thus  the  slant  matrix  in  Eq.  (3A-20)  becomes 
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(3A-19) 


We  observe  that  Sg  possesses  the  sequency  property.  It  is  easily  seen  that  the 
sequencies  of  the  rows  of  are  0,1,  1 and  2,  which  equal  the  sequencies  of  the 
corresponding  rows  of  the  Walsh-Hadamard  matrix. 
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Now  So  can  be  expressed  in  terms  of  S. 


such  that 


0-2 


where  ~ - , and  b^  = 


Eq.  (3A-20)  can  be  generalized  to  give  the  slant  matrix  of  order  N/2  by  the 
construction  as  shown  in  the  Fig.  (3A-7) 
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Fig.(3A-7)  Comparison  of  VHT  and  ST  basis  Vectors  for  N = l6 
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Similarly  the  relation  between  S2  and  is  given  by  (l) 
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where  I~  denotes  the  (2X2)  Indent! ty  Matrix 
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Where  a and  b axe  constants.  In  S-  the  slant  vector  is  obtained  by  a simple 
n n j 

scaling  operation  of  S2,  while  the  remaining  terms  serve  to  obtain  the  sequency 
and  orthogonal  properties. 


The  coefficients  (a^  , b^^)  can  be  computed  using  the  following  relations 


a2  = 1 


bN  = 


^ = 


(1  ♦ 4a*  ) ^ 


2Vn/2 


N = 4,8,16, , 


(3A-22) 


The  two-dimensional  Slant  transform  of  f ^ data  of  M rows  and  N column  is  i 


MN  = Sm  f S' 
m MN  n 


Sm  FMN  Sn 


(3A-23) 


where  M = 2m  and  N = 2n 


As  is  the  case  with  WHT  transforms  the  two-dimensional  transforms  and  the  in- 
verse can  be  computed  using  the  one  dimensional  transform,  a total  of  M X N 
times  as  t 


4 


F(0,  0)  F(0,  1)  F( 0,  N - 1) 

F(i,  o)  F(i,  1)  F(l,  N - 1) 

F(M  - 1,  0)  F(M  - 1,  1)  F(M  - 1,  N - 1) 


where  F(i,j)  is  the  (i,j)th  transform  coefficient. 


PROPERTIES  OF  STs 

(1)  The  spatial  redundancy  of  color  images  and  the  limitations  of  human  color 
vision  can  be  exploited  to  achieve  band  width  reduction  for  color  image 
transmission  (3). 

(2)  For  detecting  the  gradual  brightness  changes  in  an  image  line,  the  STs 
axe  good. 


(3)  The  transforms  are  not  dyadic  invariant  and  hence  not  useful  for  matched 
filtering  techniques. 

(4)  Fast  computational  capability  is  poor  and  so  far  in  the  literature  avai- 
lable no  Fast  Slant  transform  algorithms  are  reported. 

DISCRETE  COSINE  TRANSFORMS  (DCT)  (l)  t 

1 

It  is  known  that  the  Fourier  series  representation  of  any  continuous  real 
and  symmetric  function  contains  only  real  coefficients  corresponding  to  the 

! 


(8?) 


cosine  terms  of  the  series.  This  result  can  be  extended  to  the  discrete  Fourier 


transform  of  images  under  proper  interpretation.  There  are  two  ways  in  which  an 
image  field  can  be  made  symmetric,  as  shown  in  Fig.  (3A.-8) . By  the  first  techni- 
que the  images  axe  folded  about  an  edge,  and  in  the  second  method  the  images 
are  folded  and  overlapped  by  one  pixel.  Hence  for  an  N X N pixel  image  the  Dis- 
crete Cosine  Transforms  can  be  defined  as  follows. 


Where  F(k)  is  the  k-th  DCT  coefficient. 

It  is  worthwhile  noting  that  the  set  of  basis  vector  elements 

1 

* 

is  actually  a class  of  discrete  Chebyshev  polynomials.  This  is  easily  seen  by 
examining  the  following  definition  of  Chebyshev  polynomials 


(88) 


where  T.  (Z  ) is 
k m 


Now,  the  zeros 


hzj]  k,  m = 1,2, 

the  kth  Ghebyshev  polynomial. 


of  the  Nth  polynomial  TN(Zm)  are  given  by 


cos[k  cos 


N - 1 

(>-26) 


Z 


m 


(2m  + l)n 

cos  

2N 


m = 0,  1 N - 1 0-27) 


Substituting  Eq.  (3A-27)  in  Eq.  (3A-26),  we  evaluate  [T^Z^],  1 = 0,1,....  N - 

at  the  zeros  of  T„(Z  ) . This  results  in  the  set  of  Chebyshev  polynomials 
N ra 


T„(») 


(34-28) 


and 


V"> 


(2m  + l)kn 
2N 


m = 0,  1, 


N - 1 


which  are  equivalent  to  the  basis  set  of  the  DOT. 

Again,  the  inverse  discrete  cosine  transform  (IDCT)  is  defined  as 


f(m)  = F(0)  + 

\N 


N - 1 

D F(k) 

k = 1 


(2m  + l)kH 

cos  

2N 


» 


m = 0,  1, 


N - 1 


0-29) 


It  can  be  shown  that  application  of  the  orthogonal  property 


1,  p = q 


(3A-30) 


(3* -33) 


where  N = 2m  and  N = 2n 

As  in  the  case  of  WHT,  the  two-dimensional  DCT  and  the  inverse  can  be  computed 
by  using  the  one  dimensional  transform  a total  of  M X N times  as 

F(0,  1)  F(0,  N - 1) 

F(l,  1)  F(l,  N - 1) 

F(M  - 1,  1)  F(M  - 1,  N - 1) 


F(0,  0) 

F(l,  0) 

F(M  - 1,  0) 


where  F(if j)  is  the  (i,  j)th  transform  coefficient. 


PROPERTIES  OF  DCTs 


(1)  The  DCTs  give  superior  performance  and  is  comparable  to  K - L transform  in 
• rate  distortion  * applications  while  maintaining  the  computational  sim- 
plicity of  a transform  which  does  not  depend  on  the  picture  statistics. 

(2)  By  simulation  one  can  show  that  the  DCT  is  equivalant  in  a mean  square 
sense  to  the  K - L Transform  under  basis  restriction. 

(3)  COMPUTATIONAL  CONSIDERATIONS.  It  can  be  shown  that  the  DCT  can  be  equiva- 
lently expressed  as 


Hk)  = -ff 


Re  < e 


2N  - 1 


2N  £ f(m)Wh 


m = 0 


• k = 1,2,  mm!  N - 1 


(>-34) 


where  W = e-12  Q /2N  , i = V“li  f(m)  = 0,  m = N,  N + 1 N - 1, 

Re  [.]  denotes  the  real  part  of  the  term  enclosed. 


From  Eq.  0-34)  it  follows  that  all  the  N coefficients  of  the  DCT  can  be  com- 
puted using  a 2N-point  FFT.  Similarly  it  can  be  shown  that  a 2N  - point  IFFT 
yields  all  the  IDCT  coefficients. 


The  fast  transform  algorithms  for  Walsh  Hadamard  transforms  and  Haar  transforms 
are  discussed  in  detail  in  the  interim  report  # 4.  The  fast  slant  transform 
algorithm  does  not  seem  to  exist  so  far.  The  following  table  shows  the  impor- 
tant properties  of  the  Discrete  transforms. 


Table  1:  Properties  of  the  transforms. 

A = Excellent  B = Good  G = Satisfactory  D = Fair 


Property 


Speed  of  computation 


Hardware  implementation  feasibility 

Dyadic  shift  Invariance 

Matched  filter 

Power  spectrum 

Phase  spectrum 

Edge  detection 


WHT  HT  ST 


The  candidate  features  will  be  shown  on  a N X N grid  and  will  be  given  by 
a N X N matrix.  Each  element  of  the  matrix  represents  the  corresponding  bright- 


ness intensity  at  that  point. 


A transformed  matrix  can  have  negative  elements  as  its  entries.  Since  here 
again  we  are  only  concerned  about  the  visual  examination  of  a transform,  the 
absolute  value  of  entries  will  be  taken. 


Since  the  result  of  discrete  transformations  is  given  by  a set  of  points, 
it  is  very  difficult  to  come  to  a conclusion  just  by  observing  one  sample.  How- 
ever, some  general  remarks  can  be  made  about  the  results.  After  evaluating  the 
two -Dimensional  Discrete  transforms  we  set  a threshold  so  that  only  the  entries 
of  greatest  absolute  value  are  preserved.  Table  2 gives  the  comparative  state- 
ment in  recommending  the  different  transform  techniques  in  extracting  differ- 
ent patterns. 


i:  EXAMPLES 


In  section  B of  chapter  two,  some  candidate  features  from  the  objects 
usually  present  in  photographs  were  selected.  We  used  those  features  to  show 
the  application  of  continuous  transforms.  In  this  section  we  go  through  the  same 
procedure.  The  same  candidate  features  will  be  used  and  they  axe  shown  in 
Fig.  (3B-1). 


EXAMPLES 


In  Fig.  (3B-2)  some  typical  examples  of  cartographic  features  and  the 
corresponding  pixel  representations  by  a N X N matrix  axe  shown. 


(a)  Horizontal  line 


ith  row 


(b)  Vertical  line 
-* ith  column 

i 1 


(c)  Horizontal  line  with  shades 


(d)  Vertical  line  with  shades 
ith  column 


(e)  A horizontal  road 


(f)  A vertical  road 


1 1 1 i 

1111 


1111 


Fig.  (3B-2a)  Cartographic  features 


(g)  A patch  of  land  with  different 
shades  in  the  Horizontal 


(h)  A patch  of  land  with 


direction . 


different  shades  in  the 
Vertical  direction 


(k)  Diagonal  line  @ 135° 


(1)  Diagonal  line  @ 45° 


(m)  Diagonal  line  with  shades  @ 135  (n)  Diagonal  line  with  shades  @ 45 


Fig.  (3B-2c)  Cartographic  features 


(s)  Centered  circle  (t)  Centered  circle  with  shades 


Fig.  (3B-2d)  Cartographic  features 
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f 


Vertical  line 
Horizontal/Vertical  road 

Horizontal/Vertical  patch  of  land  with  colors 
Diagonal  line  with  @ 45°/l35° 

Diagonal  line  with  shades  @ 45°/l35° 

Slant  edge  45°,/l35° 


Horizontal  edge 


A circular  enclosure  with  shades 


A = Excellent  B = Good  C = Satisfactory  D = Fair 


Table  2 t A detailed  recommendation  for  the  use  of  different  techniques  in 


For  details  see  the  computer  printout  in  chapter  4 
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A software  package  was  developed,  for  the  evaluation  of  two-dimensional 
WHTs,  HTs,  STs,  and  DCTs  in  processing  the  data  pixels.  The  entire  program  is 
written  in  Fortran  IV  WATFIV.  For  fast  compiling  process,  the  software  package 
can  be  run  in  Fortran  IV  H Level.  Right  now  the  two-dimensional  transform  is 
computed  by  the  brute  force  technique,  such  ass 

FMN  = Am  • fMN  * An 

where 

t The  two-dimensional  discrete  transform. 

MN 

A i The  Discrete  transform  matrix 

A ' t The  transpose  of  A 

F„.t  s The  two-dimensional  data  pixels 
MW 

The  Walsh  Hadamard,  Haar,  Slant  and  Discrete  Cosine  matrices  are  readily 

available  in  the  form  of  subroutines.  Whenever  the  F„„  has  to  be  evaluated  the 

MN 

corresponding  subroutine  is  called  for  and  the  multiplication  of  the  matrices 
is  carried  through. 
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* 

I 
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This  program  enables  one  to  calculate  all  the  four  Discrete  two-dimensional 
transforms,  given  the  data  pixels  in  the  discrete  form.  It  accepts  the  data  in 
the  following  formi 
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wants  to  extend  the  domain  to  256  X 256  pixels  of  data,  the  " DIMENSION  STATE- 
MENTS " have  to  be  changed  to  256  X 2 56,  instead  of  32  X 32. 


Change  M = 8 
1 


1 

S = SQRT(2,0) 

DO  5 II  = l.N) 

DO  5 J1  = l.N) 

F(I1,J1)  = 0,0 
5 CONTINUE 

DO  10  I = 1,N 
E(1,I)  = 1,0 
10  CONTINUE 

DO  20  J = 1 ,N1 
E(2,J)  = 1,0 
E(2, J+Nl)  = -1,0 
20  CONTINUE 

DO  30  K = 1,N4 
E(3.K)  = S 
E(3,K+N4)  = -S 

E(4,K+Nl)  = S 

E(4,K+N1+N4)  = -S 
30  CONTINUE 

IF  (M-2)  120,120,150 
150  DO  40  KK  = 1,N1 

KL  = (2*KK) -1 
KM  = 2*KK 
E(N1+KK,KL)  = 2,0 
E(N1+KK,KM)  = -2,0 
40  CONTINUE 

IF  (M-4) 120, 140, 140 
140  DO  50  K1  = 1,N5 


DO  70  LI  = 1,4 
E(4+L1,K1+(L1-1)*N4)  =2,0 
E(4+Ll,Kl+(Ll-l)*N4+N5)  = -2,0 
70  CONTINUE 

50  CONTINUE 

IF(M-5)120,125,125 

125  DO  80  L2  = 1,8 

DO  60  K2  = 1,N6 

E(8*-L2,K2+(L2-l)*N5)  - 2,C 
E(8fL2,K2+(L2-l)*N5*-N6)  = -2,0 
IF(M-6) 120, 126, 126 

126  DO  81  L3  = 1,16 

DO  61  K3  = 1,N7 

E(l6fL3,K>(L3-l)*N6)  = 2.0 
E(l6*-L3,K>(L3-1)*N6+N7)  = -2.0 
IF(M-7)120,127,127 

127  DO  82  L4  = 1,32 

DO  62  K4  = 1 ,N8 

E(32+L4,K4+(L4-l)*N7)  = 2.0 
E(32*L4,K4*(L4-1)*N7+N8)  = -2.0 

62  CONTINUE 

82  CONTINUE 

61  CONTINUE 

81  CONTINUE 

60  CONTINUE 

80  CONTINUE 

120  CONTINUE 
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CHAPTER  V 


HARDWARE  FEASIBILITY  STUDY  OF  SIGNAL  PROCESSING  SYSTEM 


FOR  DISCRETE  TRANSFORMS 


A signal  processing  system  for  implementing  Discrete  transform  functions 
consists  of  (l)  an  Imaging  system  for  the  pattern  to  be  identified  (2)  a Data 
base  management  system  (3)  a Two-dimensional  transform  processor  and  (4)  an 
Interactive  Raster  scan  display  system. 


In  this  chapter  we  propose  a Microprocessor  based  special  purpose  signal 
processor  for  use  in  cartographic  studies  which  makes  use  of  the  two-dimensional 
Hadamard/Haar/siant/and  Discrete  Cosine  transforms.  All  these  discrete  Trans- 
forms are  evaluated  for  an  N X N dimensional  array  where  N = 2n  for  n being  an 
integer  0. 


f 


. 


t 
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Some  of  the  factors  influencing  the  signal  processing  system  design  are 
(l)  speed  of  computations  (2)  Ease  of  implementation  by  hardware/software  in 
real  time  applications  (3)  cost  effictiveness.  In  the  cartographic  studies  of 
Aerial  Imagery,  the  speed  of  computation  is  a very  important  criterion,  for 
example  the  strategic  Bombers,  the  high  altitude  reconnaissance  planes etc . 
The  speed  of  computation  is  a function  of  the  number  of  pixels  chosen  for  imag- 
ing a pattern.  The  bigger  the  dimension  of  the  two-dimensional  pixel  array  the 
higher  the  time  of  computation.  One  reasonable  pixel  array  dimension  is 
256  X 256  . The  type  of  Microprocessor  chosen  for  realizing  the  signal  pro- 
cessor must  be  capable  of  high  speed  acquisition  of  Data,  high  speed  Memory 
addressing  capability,  hardware  built  in  capability  to  do  high  speed  Arithme- 
tic processing  such  as  Multiply,  Divide,  Add  and  Subtract  in  floating  point 
notation,  simple  instruction  set  for  communicating  with  peripherals  in  full 
duplex  operation  and  handling  high  volume  data  storage  and  retrieval. 


Only  a 16  bit  Microprocessor  can  meet  such  a set  of  demands  as  mentioned 
above.  The  most  recent  Intel's  8086,  Zilog's  Z 8000  and  Motorola's  MC  68000, 
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stand  in  the  order  of  superiority  of  performance.  In  this  chapter  we  propose  a 
system  based  on  Z-8000  Microprocessor  which  is  already  available  in  the  market 
even  though  we  strongly  feel  the  Motorola  MO  68000  Micro process  is  (not  yet  re-  - 
leased  in  the  market)  the  outstanding  one  for  this  purpose.  The  general  design 
approach  of  the  system  discussed  in  the  following  pages  can  as  well  be  imple- 
mented with  MC  68000  in  case  one  decides  to  proceed  at  a later  stage  for  high 
speed  and  high  volume  data  applications  for  the  same  or  larger  image  arrays  as 
the  need  arises. 

AN  IMAGING  SYSTEM  i 


The  system  under  discussion  is  for  Imaging  a pattern  into  256  X 256  pic- 
ture elements.  A special  purpose  256  X 256  Element  area  Image  sensor  (CCD)  is 
required  for  this  purpose.  With  the  present  day  technology  it  is  possible  to 
build  such  a device  from  the  standard  LSI  techniques.  The  sensor  is  a solid 
state  self  scanned  area  image  sensor  suitable  for  use  in  camera  type  applica- 
tions. The  general  schematic  is  shown  in  the  fig.  (5~l). 

Light  energy  incident  on  the  image  sensor  elements  generates  a packet  of 
electrons  at  each  sensing  element.  Electrical  clocking  of  the  photogate,  the 
vertical  analog  transport  registers,  and  the  horizontal  analog  output  register 
sequentially  delivers  the  charge  packets  to  the  preamplifier.  The  charge  packets 
from  the  horizontal  register  are  sensed  by  a floating  gate  whose  potential 
changes  linearly  with  the  quantity  of  signal  charge  and  which  drives  a first 
M0S  transistor.  The  output  signal  from  the  transistor  in  turn  drives  the  gate 
of  an  output  n-channel  MOS  transistor  which  produces  the  video  output  signal  at 


terminal  VIDEO  OUT. 


BLOCK  DIAGRAM 


Fig. (5~l)  Image  sensor  (Courtesy  of  Fairchild) 


The  fig.  (5-2)  shows  the  block  diagram  of  am  Imaging  system.  The  CCD  sensor 
is  driven  by  clock  drivers  ordered  by  logic  generator  for  sequential  scanning 
purposes.  The  output  of  this  CCD  sensor  is  properly  processed  by  the  clamper 
and  the  video  Amplifier  combination.  Thus  the  Imaging  system  output  is  a video 
output  corresponding  to  the  pattern  scanned  sequentially . 


DATA  BASE  MANAGEMENT 


Each  analog  pulse  is  the  output  of  a particular  detector  during  one  scan 


of  the  array.  The  desired  signals  contained  in  the  intensity  of  the  light  are 
now  contained  in  the  photodetector  outputs.  Normally,  these  output  signals  are 
contaminated  with  a fluctuating,  as  well  as  a fixed,  pattern  noise. 


The  output  of  the  CCD  Sensor  Floating  gate  Amplifier  is  a stream  of  seve- 
ral analog  pulses  integrated  over  several  scans.  Hence  the  output  of  each  row 
of  the  horizontal  transport  register  has  to  be  processed  for  statistical  average 
before  jumping  to  the  next  row  output  of  the  horizontal  transport  register. 


Fig»(5_2)  Imaging  set  up 


I 
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UNCORRELATED  NOISE  REDUCTION  THROUGH  SIGNAL  AVERAGING  (l) 


The  first  type  of  fluctuating  photodetector  noise  to  be  minimized  is  the 
inevitable  thermal  noise  that  arises  in  all  semiconductor  devices.  This  noise 
contributes  to  random  fluctuations  about  the  average  value  of  each  array  photo- 
detector output.  Referred  to  as  uncorrelated,  the  noise  on  one  detector  is  un- 
related with  the  fluctuating  noise  on  any  other  detector.  In  some  optical  pro- 
cessing systems,  it  is  necessary  to  detect  changes  in  the  average  output  sig- 
nal that  are  smaller  than  the  fluctuations  about  the  average.  A straightfor- 
ward method  to  maintain  the  agerage  while  minimizing  the  fluctuations  is  through 
signal  averaging.  It  is  logical  to  apply  central  limit  theorm  to  derive  a set  of 
statistical  averages  of  the  signals  as  in  our  case  as  shown  below. 

It  can  be  shown  when  n samples  from  the  same  distributions  are  averaged 

then 


where  °"o  = Standard  deviation  of  signal  averager  output. 
( t ^ = Standard  deviation  of  input  samples, 
a^  = Average  value  of  signal  averager  output 
a^  = Average  value  of  input  samples. 


(5-1) 


Values  and  can  also  represent  the  rms  noise  levels  of  the  output 
and  input,  respectively,  of  the  signal  averager.  Note  from  Eqs.  (5-1)  and  (5-2) 
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that  the  average  of  the  output  values  from  a signal  averager  is  equal  to  the 
input  average,  while  the  variations  about  the  output  average  ( o'q)  is  less  than 
the  variations  about  the  input  average  ( °^).  The  result  of  signal  averaging  is 
that  the  desired  signal  (sq)  is  preserved,  while  the  undesired  signal  ( cr^)  de- 
creases. Pig.  (5-3)  shows  the  input  and  output  distributions  of  a signal  aver- 
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Flg*(5~3)  Input  and  output  distributions  of  a signal  averager 
(Courtesy  of  Computer  Design,  July,  1978) 


ager  with  n = 16. 


Implementation  of  signal  averaging  is  simplified  with  the  general-purpose 
laboratory  signal  processor  making  use  of  the  new  Z8000  microprocessor.  An 
array  of  24-bit  locations  is  established  and  initialized  to  0 as  data  accmula- 
tes  in  memory.  One  data  accumulator  must  be  reserved  for  each  detector  in  the 
photodetector  array.  An  equal  number  of  24-bit  locations  (input  block)  is  re- 
served in  memory,  which  will  contain  the  input  values  obtained  via  the  A-D  con- 
verter each  time  the  photodetector  array  is  scanned.  After  each  array  scan, 
the  content  of  each  location  in  the  input  block  is  added  to  the  content  of  its 
corresponding  data  accumulator.  A register  must  be  established  and  initialized 
to  count  the  number  of  sums  taken  in  the  averaging  algorithm.  This  counter  re- 
gister can  be  one  of  the  24-bit  registers  internal  to  the  GPU  or  a memory  loca- 
tion, assuming  that  not  more  than  255  sums  are  taken.  This  counter  register  is 
initialized  at  the  beginning  of  the  program  decremented  by  1,  and  checked  for  0 
each  time  the  data  accumulators  are  updated.  When  the  register  content  reaches  0, 
the  summing  operation  is  completed.  The  CPU  must  then  divide  the  content  of 
each  data  accumulator  by  the  number  of  sums  taken  to  arrive  at  the  final  aver- 
age. 


The  new  Z8000  has  integral  hardware  multiply  and  divide  facility.  Thus  a 
software  algotithm  which  was  earlier  required  to  carry  out  this  operation,  is 
replaced  by  a single  hardware  instruction.  In  signal  averaging,  the  GPU  can 
easily  access  to  the  data  with  more  ease  than  an  8-bit  machine. 

Note  that  the  signal  averaging  technique  will  minimize  the  effect  of  ther- 
mal noise  from  a photodetector  only  if  the  standard  deviation  of  the  detector 
noise  is  greater  than  the  rms  of  the  quantizing  noise  from  the  A-D  converter. 


If  the  least  significant  bit  of  the  A-D  converter  represents  a voltage  level 
of  K volts,  then  the  rms  of  its  quantising  noise  is  K/  \/Tz.  Therefore  signal 
averaging  will  minimize  any  random  analog  noise  if  the  standard  deviation  of 
the  noise  signal  is  equal  to  or  greater  than  K volts. 

CORRELATED  DETECTOR  NOISE  CANCELLATION 

In  optical  processing  systems,  there  is  a noise  type  that  is  correlated 
among  photodetector  outputs  because  it  causes  output  signals  of  all  photodetec- 
tors to  either  increase  or  decrease  simultaneously.  Sources  of  this  noise  are 
fluctuations  in  the  intensity  of  the  light  or  laser  source  and  in  the  photo- 
detector array  power  supply.  A simple  method  used  to  minimize  this  noise  is  to 
establish  a reference  photodetector  that  monitors  only  the  unmodulated  laser 
intensity.  The  input  signal  to  the  optical  processing  device  can  be  limited  in 
bandwidth  so  that  the  first  photodetector  of  the  array  receives  only  unmodula- 
ted laser  light.  This  photodetector  can  be  considered  the  reference  detector 
because  large  changes  in  its  output  will  be  due  only  to  laser  light  or  array 
power  supply  fluctuations. 

After  each  scan  of  the  array,  the  value  from  the  reference  detector  is 
compared  to  the  reference  value  of  the  previous  scan.  With  no  correlated  noise 
present  between  array  scans,  the  present  reference  value  will  be  equal  to  the 
previous  reference  value.  If  correlated  noise  is  present  between  scans,  the  re- 
ference signals  will  not  be  equal,  and  the  present  values  in  the  input  block 
of  data  must  be  modified  to  eliminate  the  effect  of  the  correlated  noise.  This 
modification  is  easily  implemented  in  the  signal  processor  either  before  or 
after  any  signal  averaging  by  using  the  flowchart  of  fig.  (5-4). 
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store  *P 


Fig.  (5-4)  Functional  flowchart  for  correlated  detector  noise 
cancellation  algorithm, (Courtesy  of  Computer  Design,  July,  1978) 


A signal  detector  of  a photodetector  array  can  be  used  to  obtain  reference 
detector  signal  (R)  during  scan.  is  reference  detector  signal  from  previous 
scan.  Addition  and  subtraction  operations  are  easily  implemented  because  Z8000 
CPU  uses  2' s complement  arithmetic. 
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STORE  INPUT  BLOCK  IN  MEMORY 


INPUT  REFERENCE  DETECTOR  SIGNAL  R 


SUBTRACT  |R  R,| 
FROM  EACH  ELEMENT 
OF  INPUT  BLOCK 


ADD  Jt-Rpl 
TO  EACH  ELEMENT 
OF  INPUT  BLOCK 


CALCULATION  OF  STATISTICAL  VARIANCE  USING  ARITHMETIC 


In  some  optical  processing  systems,  it  is  necessary  to  measure  the  fluc- 
tuating component  of  single  photodetector  output  quantitatively.  This  requires 

2 

the  calculation  of  the  statistical  variance  (S  ) of  n output  signals,  where 


v £ <xi  - * 


i = 1 


(5-3) 


E (x.) 


(5-4) 


i = 1 


In  Eq.  (5-3) , is  the  ith  output  sample  and  X is  the  average  of  n output 
samples.  For  a l6-bit  microcomputer,  it  is  convenient  to  let  n = 256  in 
Eq.  (5-3)  and  (5-4).  This  allows  division  by  n = 256  in  both  equations  to  be 
realized  by  truncating  the  l6-bit  sums  to  their  most  significant  eight  bits. 
After  receiving  236  detector  samples,  the  microcomputer  can  use  the  signal 
averaging  technique  to  calculate  X in  Eq.  (5-4).  Then,  X is  used  in  Eq.  (5-3)» 
The  standard  multiplication  technique  of  add-and-shift  can  be  used  to  perform 
the  squaring  operation  in  Eq.  (5~3)« 


Care  must  be  taken  during  the  calculation  of  in  Eq.  (5-3)  when  trunca- 
tion is  used  to  effect  division.  With  n = 256,  Eq.  ( 5~ 3)  can  be  rewrritten  in 


the  form 


(Xj_  - X )2  + (x2  - x)2  + ...  + (x256  - x )2 
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- 


or 

(X1  - x )2  (x2  - X )2  (x256  - X )2 

S2  ■ 25 5—*  25S ♦.-•••-  25S (5-6) 

Eqs . (5-5)  and  (5-6)  are  mathematically  equivalent,  hut  Eq.  (5-6)  cannot 
be  used  explicitly  for  small  signals  because  the  squared  terns  nay  be  less  than 
eight  bits.  If  the  squared  terms  are  divided  by  256  by  truncating  the  least 
significant  eight  bits,  the  resultant  variance  will  either  be  too  low  or  0. 
Therefore,  Eq.(5~5)  must  be  used  to  calculate  the  variance.  After  doing  the 
signal  averaging,  the  data  corresponding  to  one  is  to  be  stored  temporarily  in 
a register. 


two-dimb;sic::al  discrete  transform  processor 

Two-dimensional  transformation  for  the  data  of  the  pattern  processed 
earlier,  operates  on  N sample  values.  In  order  to  achieve  a reasonable  trans- 
mission speed  some  form  of  parallel  transformation  of  the  vector  series  is 
desirable.  Where  N is  small,  it  is  possible  to  duplicate  the  transform  hard- 
ware so  that  N separate  transform  systems,  each  operating  on  N samples,  can  be 
operated  simultaneously . This  gives  rise  to  some  redundancy  in  hardware  compo- 
nents and,  furthermore  has  the  disadvantage  that  the  transform  operation  can- 
not begin  until  the  entire  picture  has  been  scanned  and  digitized. 

* 

The  two-dimensional  finite  discrete  transform  of  a two-dimensional  array 
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X.  . of  N points  is  given  by 
1 J 
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Fig*  (5-5)  Machine  for  Parallel  A Transform 


where  A(N,N)  may  be  a WALSH/HAAR/SLANT/DISCRErE  COSINE  function  in  Discrete 


The  decomposition  of  Eq.  (5-7)  into  rows  and  columns  requires  the  complete 
matrix  f(u,v)  to  be  available. 


An  alternative  decomposition  theorm  is  suggested  (2)  based  on  the  matrix  repre- 
sentation for  the  two-dimensional  transform  given  by  the  equation 


where  A represents 


(5-8) 


an  N X N discrete  functions  WALSH/HAAR/SLANT/DISCRETE  COSINE  and  f N represents 

N X N set  of  data  values.  If  a^  = (aii  » ai2  ' aiN^  row  vec“ 

tor  of  the  matrix  A and  we  define  a column  vector,  F^  to  represent  the  product 
vector 


= fA  . A*  (5-9) 

| 

where  f.  is  the  ith  row  of  data  f„-.  as  (f.„  f, . f.0  flv,  .) 

1 NN  ' 10  il  12  IN  - 1 

then  the  transformation  of  fj^  will  be  given  by  the  sum  of  the  products  A . F^  . 

N - 1 

FNN  = ^ A • Fi  (5-10) 

1=0 

Thus,  the  process  of  two-dimensional  transformation  can  proceed  by  implementing 
three  distinct  steps. 


(1)  Each  pattern  line  which  has  been  scanned  and  digitized  is  multiplied  by  the 
discrete  function  component  to  form  a partial  transformation,  F^  . 

(2)  The  products  are  retained  in  an  accumulating  array  of  N elements  until  N 
such  products  have  been  stored  ( i . e . one  scan  line  has  been  processed) . 

The  vector  products  A . F^  are  thus  obtained  and  stored. 

(3)  Finally  the  elements  of  A.F^  axe  added  to  give  the  two-dimensional  trans- 
formation of  the  pattern  when  the  scanning  is  completed. 

The  important  feature  to  note  about  this  transformation  is  that  only  one 
line  is  operated  on,  at  a time  so  that  immediately  step  (l)  is  carried  out  and 
the  partial  transformed  vector  F^,  becomes  available,  it  can  be  copied  into  a 
store  for  step  (2)  leaving  the  store  for  step  (l)  vacant  to  hold  the  results 
of  the  next  line  calculation.  The  delay  in  processing  is  thus  reduced  from  N2 
elements  to  N elements,  constituting  one  line  of  the  pattern. 

The  following  machine  elaborates  the  implementation  of  the  transform  pro- 
cessing the  digitized  data. 

The  N2  pattern  samples  are  read  into  N2  holding  circuits.  The  values  of 
the  N2  elements  of  the  A matrix  are  permanently  connected  to  the  machine.  Each 
row  of  the  pattern  is  used  separately  and  all  rows  are  processed  simultaneously 
in  parallel.  The  N X N matrix  generated  from  processing  one  row  is  a partial 
discrete  transform  of  the  input  pattern  analyzed.  The  N partial  transforms  are 
added  together  in  parallel  to  give  the  N X N discrete  transformed  matrix.  Fig.- 
(5-5)  is  a block  diagram  of  the  machine  that  implements  parallel  ifALSH/HADAMARD/ 
HAAR/SLANT/  or  DISCRETE  COSINE  TRANSFORM. 

Input  to  the  n ^ machine  are  the  N2  pattern  elements  and  the  N2  elements 


of  the  A matrix.  The  outputs  of  the  machine  are  the  N vectors  F^  (each  has 
N components).  Fig.  (5-6)  shows  the  implementation  of  the  machine  [see 
Eqs.  (5~9)  and  (5-10)].  There  are  N outputs  from  the  II ^ machine,  each  one  is 
one  component  of  an  F^  vector.  The  machine  Fig.  (5-7)  performs  the  opera- 
tions A . for  i = 1,2,...,  N [see  Eq.(5~10)].  Fig.  (5-7)  shows  the  22 
machine  which  adds  the  respective  elements  of  the  output  partial  transforms 
from  the  H ^ machine  to  give  the  A transform  pattern.  Each  F(i, j)  output  of  the 
£ machine  Is  the  ijth  entry  of  the  A transformed  matrix. 
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The  n i machine  requires  (N  ).  N product  circuits  and  h summing  circuits. 

The  0 ^ machine  requires  (N^).  N product  circuits,  and  the  23  machine  requires 

2 2 2 
N summing  circuits.  Total  hardware  for  this  machine  is  2N  product  and  2N 

summing  circuits.  The  longest  time  required  to  get  the  complete  A transform  of 

any  pattern  would  be  t^  + t^  + t^  + = 2t^  + 2tp  where  t^  is  the  time  required 

to  generate  the  product  and  t^  the  time  required  to  generate  the  sum. 

For  binary  pictures  (where  each  picture  element  is  black  or  white),  one 
of  the  two  inputs  to  each  of  the  product  circuits  in  the  n ^ machine  is  a 
binary  0 or  1 . Thus,  the  N product  circuits  for  machine  would  be  inexpen- 
sive electronic  switches  that  are  closed  (give  a zero  output)  when  the  input 
from  the  pattern  is  zero,  and  open  (pass  through  the  values  of  +l)  only  when 

the  input  from  the  picture  is  1 . Then  the  hardware  required  for  this  machine 
3 3 2 

would  be  N switches,  N product  and  2N  summing  circuits,  and  the  longest  time 
required  to  obtain  the  complete  A transform  would  be  tg  + 2t^  + t^  where  t^  and 
t 2 are  as  before  and  tQ  is  the  time  required  to  set  the  switches  of  machine  fl  . 


The  inputs  to  the  machine  from  the  A matrix  can  be  permanently  connected 
and  each  pattern  element  corresponds  to  one  input  of  each  of  the  N product  cir- 


A > - *♦ 
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cults. 


As  seen  in  the  above  Figs.  (5“5»  5~6i  5“7)  only  one  row  of  sampled  digital 
values  axe  called  at  a time.  After  that  row  i is  processed,  the  partial  one 
dimensional  transforms  F^  corresponding  to  that  row  i are  stored  in  the  memory. 
The  data  register  is  free  now  to  handle  (i+l)th  row  data  samples.  Now  the  scan 
is  jumped  to  the  (i+l)th  row  of  the  pattern 


While  the  (i+l)th  row  data  samples  are  being  processed,  the  partial  two- 

dimensional  transforms  F.„  are  processed  and  stored  in  the  memory.  After  compu- 

in 

2 56 

ting  the  values  of  F^  for  i = 1 to  256  rows,  the  values  yT  F^  are  to  be 

i = 1 

summed  and  finally  stored  in  the  memory. 


Thus  through  the  decomposition  of  procedures,  the  transform  is  obtaimed  in 
a digital  computer  by  processing  one  line  of  the  pattern  at  a time. 


The  storage  requirements  to  obtain  the  A transform  of  am  N X N pattern 

2 

through  matrix  product  of  Eq.  (5-10)  are  i N storage  locations  for  the  pattern, 
the  algorithm  for  obtaining  the  entries  of  the  A transform  through  sub  matrices 
products  and  N^  storage  locations  for  the  resulting  transformed  matrix.  On  the 
other  hamd,  the  decomposition  procedure  requires  « N locations  for  one  line  of 
the  pattern  and  locations  for  the  transformed  matrix.  The  ratio  of  the  sto- 
rage requirements  is  given  by 


N + N 


2 

1 + 1/N 
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which  results  in  approximately  2*1  storage  reduction  as  the  dimensionality  of 
the  picture  increases  as  in  our  case. 

MEMORY  REQUIREMENT  3) » 

2 

A total  of  N + N memory  locations  axe  required  for  transform  operations 

p 

and  N for  output  storage.  For  carrying  out  the  transform  operation  in  WALSH/ 
HADAMARD/HAAR/SUNT/DISGRETE  COSINE  TRANSFORM  Domains  for  the  collected  data, 
the  total  memory  requirement  is  (N  + in  addition  to  the  N^  memory  loca- 

tions required  for  the  signal  averaging  alone. 

2 

The  total  minimum  memory  requirement  is  9N  + N.  For  N = 236,  this  demands 
a minimum  of  591  kilobits  of  storage  locations.  About  64k  X 16  ROM  memory  is 
required  for  storing  WALSH/HADAMARD/HAAR/SLANT/DISCRETE  cosine  Matrices  coeffi- 
cients for  transform  operations. 

An  additional  32k  X 16  ROM  memory  is  required  for  general  programming  data 
base  management  and  general  i/O  control  options.  The  total  ROM  memory  require- 
ment is  96k  X l6. 

The  32k  X 8 EPROM  chips  are  available  from  TEXAS  Instruments  in  the  market 
at  a competetive  price.  A total  of  6 chips  will  give  a total  96k  X 16  EPROM 
capacity  at  a price  of  $ 800. 

The  32k  X 8 EPROM  chip  TM2532  has  an  access  time  of  450n  second  and  is 
operable  with  + 5 volts  power  supply. 


The  transformed  data  has  to  be  stored  properly  in  RAM.  The  storage  re- 


quirements  for  the  WALSH/HADAMARD/HAAR/SLANT/DIS CRETE  COSINE  TRANSFORM  coeffi- 
cients is  48k  X 16  RAM  memory.  The  general  programming  and.  data  base  management 
for  I/O  terminals  requires  an  additional  l6k  X 16  RAM  memory.  The  total  RAM 
memory  requirement  is  64k  X 16  memory.  This  memory  is  available  by  Bulk  Bubble 
memories  by  Rockwell  International  or  Texas  Instruments  at  a competetive  price 
of  < $ 1000.  The  following  Fig  (5-8)  shows  the  general  structure  of  origaniza- 
tion  of  the  memory,  I/O  control,  Direct  Memory  Access  and  output  terminal  sys- 
tem. 


The  communication  between  various  I/O  terminals  can  be  achieved  through 
8288  bus  controller,  8259A  interrupt  controller  and  8089  Synchronous/Asynchro- 
nous controller  as  shown  below. 

APPROXIMATE  ESTIMATE  OF  THE  PROCESSING  TIME  OF  THE  2 DIMENSIONAL  TRANSFORMS  i 

There  are  32  Multipliers  of  type  MPY  16HJ  (l6  bits  by  16  bits)  by  TRW  for 
the  parallel  processing  of  partial  transform  of  the  ith  row.  Each  Multiplier 
takes  80nsec.  for  computation.  The  total  time  that  is  required  for  Multiplier 
is  2.56>»sec. 

There  are  128  Adders  of  Type  Am  25  LS  15  (4  bits  by  4 bits)  of  AMD  for 
parellel  processing  of  partial  transformation  of  ith  row.  Each  Adder  takes 
25nsec.  The  total  time  that  is  required  for  adding  is  3-2psec. 

This  means  for  processing  the  partial  transforms  corresponding  to  the  ith 
row  alone  is  3-2  + 2.56  = 5 -76m sec.  There  are  256  rows.  Hence  the  entire  com- 
putation time  for  a two-dimensional  transform  is  1.47 5msec. 


l/@  Arrangement  of  Z8000 


The  signal  averaging  of  the  (i  + l)th  row  must  be  done  before  the  partial 
transforms  of  the  ith  row  axe  processed.  For  accessing  and  addressing  the  ith 
row  of  the  Data,  Transform  Coefficients,  Multiplication,  Addition  and  other 
associated  addressing  modes  will  involve  less  than  1msec.  as  the  instruction 
time  for  Direct/indirect  Addressing  is  of  the  order  1msec.  The  Access  times 
for  RAM  and  ROM  memory  is  typically  of  the  order  of  less  than  lrasec.  with  the 
lateset  technology  available  today  (typically  around  450  nsec.). 

Hence  to  do  the  two-dimensional  transforms  in  WAISH/HADAMARD/HAAR/SLANT/ 
DISCRETE  COSINE  TRANSFORM  domains,  in  all  may  involve  a total  computation  time 
of  4 X 2 = 8msec, 

For  documenting  the  results  a line  printer  is  required  with  a RS  232  in 
terface  capable  of  300  baud  serial  transmission. 

INTERACTIVE  RASTER  SCAN  SYSTEM 

To  display  the  transforms  an  interactive  Raster  Scan  system  is  required. 
This  system  must  be  capable  of  accepting  formatted  data  and  computer  generated 
images  and  should  be  able  to  display  on  a Visual  screen  such  as  a Black  and 
White  or  color  TV  Screen. 

The  typical  specifications  for  such  a system  are« 

VIDEO  OUTPUT 1 EIA  composite  sync  and  blanking , 60Hz  vertical  scan  rate}  15. 57KHz 
horizontal}  0 to  -1  Volt  into  75  Ohmsi  microprogram  selectable  interlaced/non- 
interlaced displays  Scan  rate  can  also  be  set  to  25,  30  or  50Hz. 
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ALPHANUMERIC  CHARACTER  GENERATION*  8X8  Dot 


MATRIX*  512  X 512*  64  lines  at  85  characters 

Standard  64  character  ASCII  upper/lower  case  font. 

CURSOR*  Scroll/Zoom/Winking 

GRAYSCALE*  16  Gamma-Corrected  levels  from  256  level  look-up  table 

COLOR j 16  preassigned  colors  to  a standard  RGB  Color  Monitor 

COLOR  LOOK-UP  TABLE*  Optional  look-up  table;  maps  4 Intensity  bits  to  three 

4-bit  video  levels  for  standard  RGB  Color  Monitor 

Up  to  l6  out  of  4096  colors  may  appear  at  one  time  . 

IMAGE  MEMORY*  Up  to  32K  bytes  of  MOS  dynamic  memory.  Various  memory  segmenta- 
tions are  possible  under  software  control 


STANDARD  CONFIGURATIONS 


Pixels 


Bit(s) 


« 


B & W monochrome 


l6-level  gray  or  color 


1024  X 1024 

521  x 521 


DATA  UPDATE*  Either  over  DMA  or  Command  I/O 

INPUT  DEVICE  OPTIONSj  Joystick,  Trackball,  Keyboard  and  Light-Pen 

PIXEL  UPDATE  TIMES*  Based  on  average  times  to  update  Image  memory  from  new 


data  in  input  buffer 


16  pixel  update* 
Simultaneously* 


Sequential  Access ; 

45  nsec/monochrome  pixel 
140  nsec/grayscale  pixel 
Random  Access ; 

90  nsec/monochrome  pixel 
190  nsec/ gray scale  pixel 
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Single  pixel: 
Update  Mode: 


Random  Access: 


800  nsec/monochrome  pixel 
2 M sec/grayscale  pixel 

Changeover:  1.2  Msec 

COMPATIBILITY:  Z 8000  Bus 

DATA  TRANSFER  RATE:  Better  than  2 Megabites  per  second  from  host  computer 

The  Fig.  (5-9)  shows  the  system  conf Iguration  for  a typical  Raster  Display 
Unit  meeting  with  the  above  specifications.  The  following  are  some  of  the  manu- 
factures supplying  Raster  scan  Display  systems: 

(1)  Lexidata  corp.  215  Middle  Sex  Turnpike,  Burlington,  Massachusetts  - 
01803,  Tel:  (6l?)  - 273-2?00 

(2)  Matrox  Electronic  Systems  Ltd. , 2795  Bates  Rd,  Montreal,  Quebec, 

H3s  1B5,  Canada 

(3) R  Raratek  Corporation,  585  N.  Mary  Ave,  Sunnevale,  California  9^085 

Tel:  (408)  - 735  - 8W>0 

(4)  Electronic  Image  Systems,  2800  Indian  Ripple  Rd,  Dayton,  Ohio  45440, 
Tel::  (513)  - 426  - 6000 

(5)  Grinnel  Systems,  2986  Scoll  Boulevard,  Santa  Clara,  California  95050t 
Tel:  (408)  - 988  - 2100. 

(6)  Comtal  Image  ProceaaMuNip*tems,  P.0  Box  5087,  Pasadana,  California  91107 
Tel:  (213)  - 793  - 2134. 

(7)  Megatek  Corp.  3931  Sorrento  Valley  Blvd.  San  Diego,  California  92121 


Fig-  (5-9)  A System  Configuration  including  options 
(Courtesy  of  Lexidata  Corp.) 
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The  approximate  cost  of  such  a system  for  Black  and  White  display  is 
approximately  $ 3»500  and  for  the  color  display  approximately  $ 5000. 

APPROXIMATE  ESTIMATE  OF  THE  COMPONENTS  OF  THE  SYSTEM 


k 


% 


(1)  Imaging  system 

(2)  Sampler  and  A/D  converter  to  be  compa table  with  Z8000  GPU 

(3)  MICROPROCESSOR  Z8000  Mother  board  and  the  associated  i/O 
controllers 

(4)  96k  X l6  EPROM  memory  board  stacked  in  terms  of  6 pieces  of 
32k  X 8 EFRCM  - TM  2532  chips  and  buffers  to  communicate  with 

Z8000 

(5)  64k  X 16  RAM  memory  board  with  buffers  to  be  compatible 
with  Z8000  making  use  of  bubble  memories 

(6)  Arithmetic  logic  units  to  do  fast  multiplications  16  bits 
by  16  bits  and  summers  to  do  partial  sums  of  the  transforms 
TRW's  MFI  - 16HJ  multiplieri  # 32 

AMD's  Am  25  LS  15  Summers:  # 128 
(?)  Black  and  White  Raster  Scan  display 
(8)  Hard  copy  printer  with  RS  232  interface 


$ 1000 
$ 1200 

$ 1000 

$ 800 

$ 1000 


$ 57  00 
$ 3500 
$ 900 


Total  $ 15,100 


With  the  decreasing  cost  of  Microprocessors,  memory  boards,  LSI  ALU  chips 
and  display  systems,  by  1980  the  total  cost  estimate  of  the  entire  system  may 
be  around  $ 12,000. 
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CONCLUSIONS 


In  this  report,  we  tried  to  make  an  investigation  of  linear  transformations 
which  may  prove  useful  for  automatic  cartographic  analysis  and  feature  extrac- 
tion. For  feature  extraction  in  the  above  sense,  we  have  to  do  two  things, 

(l)  select  and  classify  the  standard  features  usually  present  in  photographs, 
and  (2)  select  analytically  the  transform  or  transforms  which  would  appear  to 
have  the  most  success  in  enhancing  the  identification  of  the  objects  to  be  de- 
tected. 


The  selection  and  classification  of  cartographic  features  was  done  in  chap- 
ter one.  This  classification  was  based  on  catogorizing  the  different  features 
into  three  main  groups,  point  features , line  features  and  area  features  and  is 
repeated  in  table  (6-1).  The  classification  of  cartographic  features  can  also 
be  done  according  to  some  other  classifiers.  Table  (6-II)  groups  the  features 
into  two  categories.  In  the  first  column,  the  natural  features  and  in  the  second 
column  those  features  which  are  man-made  are  shown.  Man-made  features  usually 
have  some  neat  geometrical  shape  such  as  straightness  (e.g.,  power  lines)  para- 
llelism (e.g.,  dual  highways),  periodicity  (e.g.,  grounded  aircrafts),  etc.  and 
even  sometimes  there  can  be  some  a priori  information  about  their  structure 
(e.g., highway  clover  leaf).  Natural  features  are  usually  of  irregular  shape  and 
their  geometry  cannot  be  predicted.  Table  (6-III)  categorizes  the  cartographic 
features  according  to  their  regularity  or  irregularity  in  shape. 


The  second  step  of  transforms  technique  in  feature  extraction  is  to  choose 
the  most  suitable  transform,  if  any,  for  bringing  out  or  mhancing  a specific 
feature.  To  do  this,  we  must  have  a good  knowledge  about  the  analytical  proper- 
ties of  different  transforms.  In  chapter  two  and  three,  different  continuous  and 
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CARTOGRAPHIC  FEATURES 

Point  features 

Line  features 

Area  features 

Isolated  buildings 

Rivers  and  streams  with  water 

Large  rivers 

Storage  tanks 

Drainage  channels  without  water 

Lakes 

Quarry  or  borrow  pit 

Canals 

Wash 

Tunnel  entrance 

Dual  highways 

Forest 

Primary  Roads 

Scrub 

Unpaved  roads  and  trails 

Marsh  and  Swamp 

Transmission  lines 

Mangrove 

Pipe  lines 

Orchard  and  vineyard 

Levees 

Urban  area 

Dam 

Suburban  area 

Rapids  and  falls 

Industrial  area 

Bridges 

Railroad  yard 

Shoreline 

Cemetery 

Airport 

Table  (6  - i)  : Categorization  of  Cartographic  features  according  to  their  geometry 
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CARTOGRAPHIC  FEATURES 

Natural 

i 

Main-Made 

i 

Rivers  and  streams 

Isolated  buildings 

Unpaved  roads  and  trails 

Storage  tanks 

Rapids  and  falls 

Quarry  or  borrow  pit 

Shoreline 

Tunnel  entrance 

Large  rivers 

Canals 

Lakes 

Dual  Highways 

Forests 

Primary  roads 

Scrub 

Secondary  roads 

Marsh  and  swamp 

Transmission  lines 

Pipe  lines 

Levees 

Dam 

Bridges 


Airport 


\ 

l 


Orchard  and  vineyard 
Urban  area 
Suburban  area 
Industrial  area 
Railroad  area 
Cemetery 


Table  (6-Il)i  Categorization  of  cartographic  features  according  to  their  nature. 


. Ml 


Table  (6-III)  t Categorization  of  cartographic  features  according  to  their 
regularity  in  shape.  _ — 


CARTOGRAPHIC  FEATURES 

Neat  Geometrical  shape 

Irregular  Shape 

Isolated  buildings 

Rivers  and  Streams 

Storage  tanks 

Unpaved  road  and  trails 

Tunnel  entrance 

Rapids  and  falls 

Canals 

Shoreline 

Dual  highways 

Large  rivers 

Primary  roads 

Lakes 

Transmission  lines 

Scrub 

Pipe  lines 

Marsh  and  swamp 

Dam 

Quarry  or  borrow  pit 

Bridges 

Secondary  roads 

Airport 

Levees 

Orchard  and  vineyard 

Drainage  channel 

Urban  area 

Suburban  area 

Railroad  yard 

Industrial  area 

Cemetery 

discrete  transforms  were  studied.  Among  continuous  transforms,  Fourier  and  Hankel 
transform,  and  among  discrete  transforms,  Hadamard,  Haar,  Slant  and  Discrete  * . 

Cosine  transform  were  selected.  Two-dimensional  matched  filtering  and  also  some 
new  extrapolation  techniques  were  discussed  in  the  context  of  continuous  trans- 
forms . 

Another  important  method  in  feature  extraction,  namely  line  and  curve  detec- 
tion was  not  discussed  in  this  report  because  it  was  not  in  the  scope  of  our 
transforms  study.  It  can  be  found  in  the  literature  under  picture  processing 
techniques.  Table  (6-IV)  gives  a list  of  different  techniques  for  feature  extrac- 
tion. 

In  studying  different  transforms,  emphasis  was  put  on  two  things  t 

(a)  How  a particular  transform  behaves  as  the  input  object  image  is  trans- 
lated, rotated  or  scaled.  This  is  a very  useful  consideration  because  usually 
there  is  no  a priori  information  about  the  location,  orientation  or  size  of  an 

object  image  in  a picture.  Morever,  since  the  size  of  an  object  image  depends  on 
the  distance  from  which  a picture  has  been  taken,  it  will  be  advantageous  to 
know  the  response  of  a transform  to  the  scaled  versions  of  an  object.  In  Table 
(6-V)  we  have  summarized  the  translational, rotational  and  size  invariance  pro- 
perties of  the  transforms  studied. 

I 

(b)  How  powerful  is  a particular  transform  in  bringing  out  a specific  fea- 
ture ? To  answer  this  question  some  candidate  features  were  selected  and  differ- 
ent transforms  were  applied  to  those  features.  Based  on  these  examples  and  also 

* 

based  on  analytical  properties  of  the  transformations,  some  general  remarks  can 
be  made.  It  should  be  noted  that  all  our  conclusions  are  based  on  the  observa- 

(141) 


w 


I 


Two-dimensional  linear 

transforms 

Other  techniques 

Fourier 

Matched  filtering 

Hankel 

Hadamard 

Extrapolation  techniques 

Haar 

Slant 

Line  and  curve  detection  * 

Discrete  cosine 

Table  (6-IV)  : A list  of  different  techniques  for  feature  extraction 
* Not  discussed  in  this  report 


Techniques 

Translation 

Invariant 

Rotation 

Invariant 

Size 

Invariant 

Fourier  transform 

O.K.  (spectrum) 

The  spectrum  rotates 

The  spectrum  is  scaled 

Hankel  transform 

O.K 

O.K 

The  spectrum  is  scaled 

Hadamard  transform 

NO 

NO 

NO 

Haar  transform 

NO 

NO 

NO 

Slant  transform 

NO 

NO 

NO 

Discrete  Cosine 
transform 

NO 

NO 

NO 

Matched  filtering 

O.K 

Can  be  modified  to  do 

so 

Can  be  modified  to  do 
so 

Extrapolation 

techniques 

O.K 

O.K 

O.K 

Table  (6-V)  t The  behaviour  of  different  feature  extraction  techniques. 
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tions  we  have  made  and  so  they  will  he  judgements  to  best  of  our  knowledge.  We 
do  not  guarantee  that  these  conclusions  axe  the  most  accurate  ones  because  we 
have  not  made  an  exhaustive  search  for  all  cartographic  features.  This  ia  a 
caveat . 


Continuous  transformations  like  Fourier  and  Kankel  transform  give  a very 
neat  analytical  result  when  applied  to  the  selected  features. 

The  Fourier  transform,  because  of  the  nature  of  its  kernel,  appears  to  be 
the  most  powerful  transform  in  bringing  out  periodic  structures  such  as  parallel 
streets . 

Hankel  transform  seems  very  useful  in  bringing  out  features  with  circular 
symmetry  such  as  storage  containers. 

The  two-dinensional  matched  filtering,  seems  the  most  powerful  means  of  fea- 
ture extractidn  provided  some  a priori  information  is  known  about  the  shape  of 
the  feature  to  be  detected  (e.g.  airport)  . If  we  know  exactly  what  we  are  look- 
ing for  in  a picture,  we  can  build  a suitable  matched  filter  for  detecting  that 
particular  feature.  We  saw  that  the  matohed  filter  can  be  modified  to  detect 
scaled  versions  or  rotated  versions  of  an  object.  Conceptually  matched  filter- 
ing c?n  be  performed  easily  using  a digital  computer. 

In  the  case  of  discrete  transforms,  it  was  observed  that  in  almost  all 
of  the  cases  of  the  cartographic  features,  the  Walsh  Hadamard  transforms  have 
a decided  advantage  over  the  other  discrete  transforms.  Only  in  a couple  of  cases, 
are  Slant  transforms  promising  when  compared  to  the  others.  Even  the  Slant  edges 
may  be  detected  by  WHTs  if  proper  threshold  filtering  or  zonal  filtering  techni- 


i 
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In  addition,  the  VHT  offers  super  fast  computational  flexibility  when  com- 
pared to  the  whole  set  of  the  other  transforms  either  discrete  or  continuous. 
Such  fast  WHT  algorithms  are  available  extensively  in  the  literature.  Besides  in 
real  time  applications,  the  hardware  realization  can  be  done  with  minimum  storage 
requirements  and  investment.  Such  fast  WHT  processors  axe  already  available  in 
the  market  . 

Another  subject  which  was  discussed  in  this  report,  relates  to  the  extra- 
polation of  bandlimited  signals.  We  proposed  that  extrapolation  techniques  can 
be  used  to  enhance  a part  of  a picture  degraded  by  noise.  This  idea  may  seem  a 
little  ambitious,  but  we  believe  that  this  technique  may  be  used  to  bring  out 
features  in  one  part  of  a picture,  from  the  information  conveyed  by  the  rest  of 
that  picture.  More  work  has  to  be  done  on  this  subject. 

RECOMMENDATIONS 


The  main  purpose  of  the  work  done  on  this  report  has  been  to  study  picture 
transformations,  to  illustrate  their  analytical  properties  and  to  compare  diffe- 
rent characteristics  of  various  transforms.  However,  thoughout  this  paper  we 
tried  to  illustrate  which  picture  transformations  are  the  most  suitable  ones  for 
bringing  out  a specific  cartographic  feature.  The  selection  of  condidate  fea- 
tures, using  examples  and  applying  different  transforms  to  those  examples  all 
have  been  in  the  scope  of  accomplishing  the  above  task.  In  the  first  section  of 
this  chapter,  we  made  some  conclusions  based  on  our  studies.  It  was  pointed  out 
that  those  conclusions  are  best  to  our  knowledge  and  cannot  be  taken  as  the 
ultimate  words. 
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In  this  part,  we  like  to  give  some  recommendations  about  using  different 
transforms  in  extracting  different  cartographic  features.  This  task  is  not  sim- 
ple and  our  judgements  may  look  somehow  ambiguous  at  different  points.  However, 
we  do  not  claim  that  these  recommendations  are  the  most  accurate  ones.  They  are 
based  on  our  observations  and  our  observations  could  not  be  exhaustive.  Much 
more  study  is  needed  on  this  subject  to  bring  more  plausible  results. 

In  Table  (6-Vl),  a general  recommendation  is  given  on  the  use  of  different 
feature  extraction  techniques.  Our  judgement  for  this  recommendation  is  based  on: 

(a)  since  all  picture  transforms  have  some  well  defined  mathematical  base 
functions,  or  kernels,  they  are  most  suitable  for  extracting  features  with  neat 
geometrical  shape.  For  example,  the  Hankel  transform  with  the  Bessel  function  in 
its  kernel  is  very  powerful  in  bringing  out  circular  object  images. 

(b)  when  a man-made  cartographic  feature  has  an  irregular  shape,  using 
image  transforms  will  not  be  much  effective.  However  there  is  usually  an  a 
priori  information  about  the  shape  or  geometry  of  that  object.  A highway  clover 
leaf  is  an  example  of  this  kind.  If  such  information  is  available,  then  a re- 
plica of  that  object  image  can  be  used  to  build  a matched  filter  for  detecting 
the  object. 

(c)  Cartographic  features  which  are  usually  observed  in  nature,  such  as 

rivers,  caves,  etc.,  neither  have  a neat  geometrical  shape  nor  is  there  any  a 
priori  information  about  their  structure.  The  use  of  picture  transforms  or  mat- 
ched filter  do  not  seem  very  useful  in  this  case.  It  appears  that  line  and  curve 
detection  techniques  are  the  best  methods  to  enhance  them  . 
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In  Table  (6-VII),  we  have  given  detailed  recommendations  for  the  use  of 
different  techniques  in  extracting  various  cartographic  features.  For  each  car- 
tographic feature,  we  have  recommended  three  different  transforms  with  three 
different  ratings.  Again  we  mention  that  these  recommendations  are  based  only 
on  our  observation  and  in  no  way  should  be  considered  as  the  ultimate  truth. 

Our  judgements  in  making  up  Table  (6-VII)  are  based  on  : 

(1)  The  examples  used  in  chapter  two  and  three  and  the  computer  results  in 
chapter  four. 

(2)  The  general  recommendation  given  by  Table  (6-Vl). 

(3)  The  power  of  Fourier  transform  in  bringing  out  periodic  structures. 

(4)  The  power  of  Hankel  transform  in  bringing  out  images  with  circular 
symmetry . 

(5)  The  power  of  Hadamard  transform  in  bringing  out  lines. 

(6)  The  power  of  Slant  transform  in  bringing  out  shaded  edges. 

(7)  The  power  of  Matched  filters  in  bringing  out  images  with  known  shapes. 

(8)  The  power  of  Line  and  curve  detection  techniques  in  bringing  out  curva- 

tures. 

(9)  In  the  case  of  any  tie  between  using  a transform  or  the  matched  filter, 
preference  is  given  to  the  transform  because  of  its  simpler  applicability. 

(10)  In  the  case  of  any  tie  between  using  a transform  or  curve  detection  techni- 
ques preference  i3  given  to  the  transform  because  this  paper  is  devoted 

to  transforms  study. 

(11)  In  the  case  of  any  tie  between  two  transforms,  preference  has  been  given 
to  the  transform  with  more  computational  simplicity. 
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Haar 

Slant 

Discrete 

Matched 

Line  and 

Cosine 

filter-  1 

curve  de- 

trans. 

trans. 

trans 

ing 

tection 

Table  (6-VIl)i  A detailed  recommendation  for  the  use  of  different  techniques 


in  extracting  different  patterns. 

A = Good  B = Satisfactory  C = Fair 


RECOMMENDATIONS  FOR  FUTURE  STUDY 

The  ultimate  objective  of  realizing  a completely  automatic  and  computer- 
ized picture  reading  machine  for  cartographic  analysis  is  a long  way  off.  More 
likely  to  be  realized  within  the  decade  is  a hybrid  man-machine  device  which 
would  use  feature  extraction  and  object  identification  to  aid  cartographic 
readers  to  reduce  the  tedium  and  increase  productivity  and  accuracy.  It  is  there- 
fore necessary  to  focus  further  research  effort  at  those  problems  which  are  like- 
ly to  achieve  success  in  this  objective.  It  appears  that  at  this  time,  the 
approach  would  be  to  create  a system  which  does  cartographic  analysis  as  follows: 


FUNCTION  SYSTEM 

(a)  Selection  of  general  category,  man-made  Human 

or  natural 

(b)  Based  on  selection  (a)  apply  appro-  Computer 

periate  signal  processing  algorithms 

(Transforms,  edge  detection,  etc) 

(c)  Based  on  results  of  (b)  interactively  Human-Computer  interac- 

ad just  parameters  tion 


(d)  When  human  is  satisfied,  select  Computer 

features  of  highest  significance 

(e)  Present  cartographic  objects  of  interest  Computer 

in  sharp  contrast  and  give  confidence 

levels 

(f)  Examine  more  carefully  those  objects  Human 


(g)  Repeat  (a)  for  another  category 


Human 


To  accomplish  this  objective,  it  will  be  necessary  not  only  to  further 
explore  signal,  processing  techniques  and  the  technalogy  to  support  it,  but  also 
to  develop  adequate  decision  rules  specifically  for  discriminating  cartographic 
objects  of  interest.  This  will  require  study  of  s 

(l)  Algorithms  of  clustering  and  recognition  of  cartographically  interesting 
objects.  This  will  include  more  refined  feature  definition  and  detection. 

(2)  Statistical  analysis  to  determine  confidence  levels. 

(3)  The  generation  of  software  and  possibly  a specialized  high-level  language. 

(4)  Investigation  of  specialized  hardware  signal  processes  for  increased  speed. 
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APPENDIX  1 


SCALED  VERSIONS  OF  A TWO-DIMENSIONAL  FUNCTION 


Let  f(x,y)  be  a curve  in  the  x-y  plane  whose  corresponding  equation  in 
polar  coordinates  is  given  by  P ( p , d)  • If  this  curve  has  a closed  contour 


we  assume  that  the  origin  of  the  x-y  plane  is  located  inside  the  contour.  A 
scaled  version  of  P ( P , o)  is  given  by  P(kp  , 0 ) in  which  k is  the  scaling 


the  scaled  version  of  f(x,y)  can  be  obtained  by  putting 


and  thu3  getting  f(kx,ky)  as  the  scaled  version  of  f(x,y).  If  , th the 

curve  is  magnified  . For  k >>  1 , the  reduced  scaled  version  of  the  curve  is  ob 


tained.  The  figure  below 


hows  a function  and  its  scaled  versions 


Fig  (1-1)  Scaled  versions  of  a function 


APPENDIX  II 


PROLATE  SPHEROIDAL  WAVE  FUNCTIONS 


Prolate  Spheroidal  Wave  Functions  are  a set  of  functions  with  the  following 
properties  s 

(i)  The  set  of  functions  is  countably  infinite  and  all  the  functions  are 

real. 

(ii)  The  V^(t)  are  complete,  bandlimited  and  orthonormal  on  the  real 
line,  i.e., 


+ 00 

J ^(t.c) 

- oo 


*j(t,c)  dt  = 


u 

1 


j 

j 


(II-D 


c is  a constant  related  to  the  cutoff  frequency  and  the  known  interval  T, 


c = 


11 T 


(II -2) 


(iii)  In  the  interval  \tl  T , the  'P.(t)  are  complete  and  orthogonal  i.e., 

2 1 


♦f 


J\{ t,c)  *(t,c)  dt  = 


i*  j 

J 


(H-3) 


(iv)  The  functions  <f^(t)  are  obtained  from  the  following  integral  equation, 


♦ I 

2 


\(c)  = Ain  ll,(t  - s)  „ (SfC)  da  (II-4) 

J n(t  - s)  x 


T 

2 


(163) 


in  which  A(c)'s  are  reed  positive  numbers  (eigenvalues)  so  that 


x0  > X1  > X2  > * ’ * (II-5) 


and  each  eigenvalue  ^(c)  corresponds  to  an  eigenfunction  ^(t.c) 


There  exists  no  exact  analytic  expression  for  Prolate  Spheroidal  functions, 
but  these  functions  have  been  tabulated.  In  Fig.(ll-l)  the  Prolate  Spheroidal 
functions  for  c = 1,  2 and  4 are  shown.  Table  (l)  gives  the  values  of  A (c). 


c = 0.5 


=1.0  c = 2.0 


c = 4.0  c = 8.0 
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Computer  Program 
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